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Preface 


The Canadian Number Theory Association (CNTA) is an informal organization of 
Canadian number theorists, founded in 1987 through the initiative and efforts of the 
late Richard Mollin of the University of Calgary. The following is a complete list of 
the CNTA conferences. 


CNTA I Banff Center, Banff, Alberta 1988 
CNTA II University of British Columbia 1989 


CNTA IIIT = Queen’s University 1991 
CNTAIV Dalhousie University 1994 
CNTA V Carleton University 1996 
CNTA VI University of Manitoba 1999 
CNTA VII Université de Montréal 2002 
CNTA VII University of Toronto 2004 
CNTA IX — University of British Columbia 2006 
CNTA X University of Waterloo 2008 
CNTA XI Acadia University 2010 
CNTA XII University of Lethbridge 2012 
CNTA XIII Carleton University 2014 


This volume contains the proceedings of the thirteenth CNTA conference, which 
was held at Carleton University, Ottawa, Ontario from June 16 to 20, 2014. There 
were 154 participants at the conference from across all continents. The scientific 
program of the conference consisted of one public lecture, one Ribenboim Prize 
lecture, ten plenary talks, 22 invited talks and 65 contributed talks, for a total of 99 
talks. 

The Conference Organizing Committee comprised Ayse Alaca (Carleton), 
Saban Alaca (Carleton, main organizer), Paul Mezo (Carleton), Damien Roy 
(Ottawa), Abdellah Sebbar (Ottawa), Gary Walsh (Ottawa), Hugh Williams 
(Calgary/Carleton) and Kenneth Williams (Carleton). The Scientific Advisory 


vi Preface 


Committee consisted of John Friedlander (Toronto), Steve Gonek (Rochester), Eyal 
Goren (McGill), Stephen Kudla (Toronto), Cameron Stewart (Waterloo), Hugh 
Williams (Calgary/Carleton) and Kenneth Williams (Carleton). The Ribenboim 
Prize Committee was made up of Adrian Iovita (Concordia), Ram Murty (Queen’s) 
and Damien Roy (Ottawa). 

The conference was supported in part by Carleton University, University of 
Ottawa, Centre de Recherches Mathématiques, the Tutte Institute, Number Theory 
Foundation, the Fields Institute and the National Science Foundation. 

This volume contains written versions of some of the presentations. All papers 
were refereed. We thank all the authors for their considerable care and effort in 
preparing their manuscripts for these proceedings. Special thanks are due to the 
referees for their important contribution to these proceedings. 

We gratefully acknowledge the faculty, staff and students of the School of 
Mathematics and Statistics at Carleton University for their help and support, 
Debbie Iscoe (Fields Institute) for her help with assembling this volume and Carole 
Williams for her assistance with the conference registration. 


Ottawa, ON, Canada Ayse Alaca 
March 2015 Saban Alaca 
Kenneth S. Williams 
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Identities for Logarithmic Means: A Survey 


Bruce C. Berndt and Sun Kim 


Abstract We provide a survey of identities for sums of the type )©,,-, a(n) log(x/n). 
In each case, a(n) is an arithmetical function generated by a Dirichlet series 
satisfying a functional equation involving the gamma function. Moreover, all of 
the identities given in this paper feature infinite series of Bessel functions. 


1 Introduction 


Let a(n) be an arithmetical function generated by a Dirichlet series satisfying a 
functional equation involving the gamma function J"(s). For example, let 7;(n) 
denote the number of representations of the positive integer n as a sum of k squares. 
Then its generating function 


CO 


oj (s) = So re(nyn™, o =Res> $k, (1) 


n=1 


satisfies the functional equation 
nD (s)te(s) = WP PSK s\&(Gk 5). 


Second, let d(n) denote the number of positive divisors of the positive integer n. If 
€(s) denotes the Riemann zeta function, it is easily seen that 


£7(s) = So dan™, o>l, 


n=1 
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which satisfies the functional equation 
mx T7(4s)0%(s) = a P2(4 — bs)C7( = 9). 


Third, let t(m) denote the Ramanujan tau-function. Then Ramanujan’s Dirichlet 
series 


CO 


f(s) := > t(n)n*, o> B. 


n=1 


satisfies the functional equation 
(2x) *I'(s)f(s) = (20) ©? © F(12 — s)f (12 — 5). 


In each case, the functional equation can be used to analytically continue the 
Dirichlet series to the entire complex s-plane. 

For an arithmetical function a(n), we often desire an asymptotic formula for 
Y<, a(n), or, if we divide by x, we ask for the average order of a(n). If a(n) 
is generated by a Dirichlet series satisfying a functional equation involving I"(s), 
then )”,,-, a(n) often satisfies an identity containing an infinite series of Bessel 
functions [10]. 

For example, 


3 "n(n = 2x+ Yorn) ‘ele J (27 s/nx) 


O0<n<x n=1 


mx + P(x), (2) 


where P(x) is the “error term,’ and where J,,(x) is the ordinary Bessel function of 
order v defined by 


CO 


_ (-1)” Z\v+2n 
Iy(2) = ercarer: i (5) ,  OK<|l<a~, vec @) 


n=0 


The prime / on the summation sign on the left side of (2) indicates that if x is an 
integer, then we count only Ae (x). One of the most famous unsolved problems in 
analytic number theory is the circle problem: find the precise order of magnitude of 
P(x) as x — oo. It is conjectured that, for every € > 0, P(x) = O(x'/4+*) as x tends 
to oo. A history and survey of the circle problem can be found in [7]. 

For a second example, we return to d(n). First define the Bessel function Y, (z) 
of the second kind of order v by [21, p. 64, Eq. (1)] 


J,(z) cos(vz) — J—y(z) 


ee sin(vz) 


: (4) 
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and the modified Bessel function K,,(z) of order v by [21, p. 78, Eq. (6)] 


iv /2 “o\ __ paiv/2 * 

mw e™/2J_ (iz) —e J, (iz) 

K,(2) := : J 
@) 2 sin(vz) ©} 


If v is an integer n, then it is understood that we define the latter two functions by 
taking the limits as v — n in (4) and (5). Let 


LG=—V6= “x, (). (6) 


Then a famous identity of G.F. Voronot [20] asserts that 


y_ d(n) =x(logx + 2y—1)+ ; ++ yd(n) Qc /nx) 


n<x n=1 
1 
=:x(logx+2y—1)+ Z + A(x), (7) 


where y denotes Euler’s constant and A(x) is the “error term.” The Dirichlet divisor 
problem asks for the exact order of magnitude of A(x) as x — oo. Voronoi 
employed (7) to prove that A(x) = O(x!/3 logx) as x —> oo. It is conjectured that 
A(x) = O(x'/4+¢), for each € > 0, as x > 00, and a survey of this difficult classical 
problem can also be found in [7]. 

In many cases an identity may not exist for }°,_,a(n), but it may exist for 
y <, a(n)(x—n)? for sufficiently large p. Moreover, the weighted identity generally 
converges absolutely and uniformly on compact subintervals of (0,00) making it 
more convenient to use than an identity for )>,,-,.a(m), which is discontinuous at 
positive integers n when a(n) 4 0. Then one can apply a method of finite differ- 
ences, originally due to E. Landau, to the sum 7... a(n)(x—n)? to gain information 
about ><, a(n) [11, Theorem 4.1, pp. 106-111]. The sums )>_, a(n)(x — n)? are 
sometimes called Riesz means. Note that for “small” n, the contribution of (x — n)? 
is “large,” while for large n, the contribution of (x — n)? is “small.” Very roughly, 
arithmetic functions are “small” for “small” n and “large” for “large” n, and so 
(x —n)? acts as a “smoothing factor.” In fact, W. Sierpinski [18] used an identity for 
>< 12(n) (x — n) to show that P(x) = O(x'/), as x tends to infinity. 

We emphasize that the logarithmic sums Yo <, a(n) log’ (x/n) can also be used in 
the study of the average order of certain arithmetic functions, since log?(x/n) has a 
“smoothing” effect similar to that of (x—n)?, for in each case, when n is “small,” the 
contributions of these factors are “large,” while when n is “large”, the contributions 
of these factors are “small.” Generally, for “small” p, the simplicities of the identities 
for )°,-,a(n)(x — n)? and >>. a(n) log’(x/n) are comparable, but for “large” p, 
the identities for the former sum are usually more elegant than those for the latter 
sum. It is likely for this reason that Riesz means have been employed instead of 
logarithmic means in the study of the average order of arithmetic functions. 
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The goal of this paper is to provide a survey of identities for )°,_, a(n) log(x/n), 
which, because of their intrinsic beauty, deserve to be better known than they are. 
We are confining our attention only to the case p = 1, because with increasing p, 
the logarithmic mean identities diminish in elegance; see, for example, [3]. 


2 History and Examples 


To the best of our knowledge, the first logarithmic mean identity is due to 
A. Oppenheim [14] in 1927. Let d(n) denote the number of divisors of n. Then 
[14, p. 342], for x > 0, 


1 1 
y d(n) log —_ xlogx—x+ (2y—1)x+ ri logx + . log(2z) 
n 
nsx 
1 Sdn) 
20 8 


[y(4a Jinx), (8) 


where y denotes Euler’s constant, Jo is defined by (6), and the series on the right- 
hand side of (8) converges absolutely and uniformly on compact subsets of (0, 00). 
Oppenheim’s proof is interesting. Let o,(n) = Dual d‘. He derives a general 


identity for >?,,<, ox(n)(x — n)? [14, p. 340], which we relate only for |k| < 5 and 
p = 0. Let 


F(z) := cos(4vz)J, (z) + sin(Svz)1, (2), 


where J,,(z) and /,,(z) are defined in (3) and (6), respectively. Then 


n 


f = / 
Tou) =H) + Dam (2) Fidevm, ——®) 
n=1 


nsx 
where @,(x) is the sum of the residues of 


C@SE—K) 


& 


Using (9), Oppenheim forms an identity for 


L(k) =7 YS o4(n) — S) ox(n) . (10) 


n<x nsx 
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He then takes the limit of his identity for L(k) as k — 0. Oppenheim did not 
provide any details, and we will not as well, except that we will evaluate, by (10) 
and L’Hospital’s rule, 


lim L(k) = lim x logx )> o4(n)—x* S° Yd *logd—Y° Yd‘ logd 


nSx nsx d\n nsx d\n 
=logx ))d(n)—2)°) )logd 
n<x n<x d\n 
a logx ) *d(n) a > ~ logn 
nsx NSXx dln 
= py d(n) log - 
nsx 


because 


Y“logd = dbz = = lognd(n) — Yo logd. 


d\n d\n d\n 


The only other proof of (8) of which we are aware is due to the first author 
[3, p. 371], who deduced (8) from a general identity, established by him for 
<x a(n) log’ (x/n) by a method completely different from that of Oppenheim. 

Oppenheim [14, p. 312] further remarks, “Similarly we can obtain identities for 
...and 


x 
= r2(n) log i 


nsx 


However, Oppenheim provides no details. In 1954, C. Miiller [13] proved that 


r*(1/4) 
oa 


+ O74), (11) 
4n 


x 1 
S > ro(n) log — wx —logx — Z lo 


nsx 


as x —> oo. He did not provide an exact formula for the error term on the right side 
of (11). L. Carlitz [9] gave a simpler proof of (11), but his method did not yield a 
value for the constant term in closed form on the right-hand side of (11). The first 
author [3, p. 372] established an identity for the first time, showing that 


n 


1 lee) 
D-ro(n) log — = mx —logx + (0) — = Jo(2 V/nx), (12) 


n<x n=1 
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where ¢5(s) is defined in (1). The error term in (11) follows readily from the well- 
known asymptotic formula [21, p. 199] 


4 \'2 1 
Jy(x) = (=) cos (s— fon — 4x) +0 (<a). (13) 


as x — oo. R. Ayoub and S. Chowla [2], [12, pp. 1189-1191], unaware of [3], 
established (11), including showing that 


réa/a) 


1 

/ 
=--] 
£(0) = —7 log —— 


D. Redmond [16] generalized the work of Ayoub and Chowla by replacing 
r2(n) by 


dyx(n) = > @y2(n/4), (14) 


d|n 


where x; and 72 are characters modulo q; and q2, respectively. Thus, when 
xi(n) = 1 and x2(n) is the primitive, non-principal character modulo 4, then 
dy, y(n) = ir(n). If x1(2) = x2(n) = 1, then d,, ,,(n) = d(n). Redmond did 
not prove (8) and (12), but instead established results in which the infinite series 
of Bessel functions are replaced by error terms. In these two examples, Redmond 
claimed error terms improving that of Miiller and of Ayoub and Chowla in (11), and 
also that obtained by approximating the Bessel functions in (8) by (13). However, 
in [17], Redmond acknowledged that his claimed error terms are incorrect and that 
his methods only yield error terms that are obtained by using (13). 

C. Calderén and M.J. Zarate [8] generalized Redmond’s work by proving an 
asymptotic formula for }°,-, a(n) log‘ (x/n), where a(n) is an arithmetical function 
(too complicated to state here) considerably generalizing (14). 

U.M.A. Vorhauer [19, p. 60, Theorem 2], apparently unfamiliar with the earlier 
work of Miiller, Carlitz, Ayoub and Chowla, Redmond, and Calderén and Zarate, 
also established (11). Her result was actually a special case of a general theorem 
in which she established an “identity” for }*,,-,.a(n) log*(x/n), where k > 0 and 
a(n) is generated by a Dirichlet series satisfying a functional equation involving a 
very general product of gamma factors. Her theorem [19, p. 59, Theorem 1] is more 
general than the one proved in [3], where the arithmetical functions are generated 
by Dirichlet series satisfying functional equations involving I”’"(s), where m is a 
positive integer. However, Vorhauer’s general “identity” contains an “error term,” 
for which estimates are given, while the identity in [3] is exact and written in terms 
of Bessel functions. The primary purpose of Vorhauer’s paper is to provide precise 
estimates for the “error terms.” 
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Let us return to Ramanujan’s tau-function, one of the three examples discussed 
in the Introduction. From [3, p. 372], we record the identity 


y-r(n) log — = =a 2 > of" MO ih eax (15) 


nsx 


The identity (15) illustrates a common roadblock in establishing elegant logarithmic 
mean identities—the identities often involve integrals that are not readily evaluated 
in closed form, as is the case with the integrals on the right side of (15). 

The present authors and their colleague, A. Zaharescu, have devoted considerable 
efforts in recent years to establishing two intriguing identities found in Ramanujan’s 
lost notebook [15] that are connected, respectively, with the classical, unsolved 
circle and divisor problems. For example, see their paper [6], their survey paper 
[7], and an account of much of their work in the book [1] by Andrews and Berndt. 
We record only one of the two identities. First define 


[x], if x is not an integer, 


= | (16) 


1 . . . 
X—5, ifxis an integer. 


We offer now the first entry. 
Entry 1 (p. 335). Let F(x) be defined by (16). If0 < 6 < 1 andx > 0, then 


De; ) sin(27n0) = mx (5 - 0) - 7 cot(r8) 


co oo (I (4x m+ Ox) I (4x me + 1— Ox) 


"3 p> Jmnt+0)~—~——S mn + 1-0) 


It was natural for us to ask if there exist logarithmic mean identities correspond- 
ing to Ramanujan’s two entries on page 335 of his lost notebook. Indeed, we found 
such analogues. However, to establish these logarithmic analogues, we first need to 
establish analogues of (8) for weighted divisor functions, which we now define. 

For a character y, define the weighted divisor sum 


d,(n) = )> x(d). 


d|n 


If y is a character modulo gq, the Gauss sum t(y) is defined by 


q-1\ 
r= > oe". 


h=1 
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Lastly, for any character y modulo q and o > 1, recall that the Dirichlet L-series is 
defined by 


Co 
L(s, x) = = x)n~. 
n=1 
It is well-known that L(s, 7) can be analytically continued into the entire complex 
s-plane. 
We are now ready to state two new logarithmic mean identities [5]. 


Theorem 2. [f y denotes an odd primitive character modulo q, then 


it(X) 
Qn 


yd, (n) log — = xL(1, x) + — log(2ax) LI, X) — SUC. x) 


nsx 


4 Dye a aa (42 /nx/4). (17) 


n=1 
Theorem 3. /f y is an even, non-principal, primitive character modulo q, then 


dz x) 


1 
AO) log — = xL(1, x) — 5E'O.0) = aed Pe 


nsx 


In(4x /nx/q). (18) 


Theorems 2 and 3 are special cases of theorems in [3], but the identities and the 
details of their proofs were not previously worked out until the authors did so in [5]. 

We are now ready to offer the logarithmic mean identity motivated by Entry 1, 
established with the key aid of Theorem 2, and proved by the authors in [5]. 
(A corresponding entry motivated by Ramanujan’s second entry on page 335 of 
[15] and established with the help of Theorem 3 will not be given here.) 


Theorem 4. Let x > 0 and0 < @ < 1. Then 


oa log . > sin(27ré) 


nsx r|n 
2 
= erty cot(7z0) + mx (5 - 0) + —(n) —yd— )) 
1 Jo (4x Jm(n+ Ox) Jo (4x min +1— Ox) 
4x X (m(n + @)) 7 (m(n + 1 — @)) ’ 
n>0 
(19) 


where y\(@) and y,(1 — @) are the Laurent series coefficients of the Hurwitz 
zeta function €(s,a), also called generalized Stieltjes constants, defined by Berndt 
[4, p. 152] 
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1 Cc 
S(s,a)= — + Y— yn(a)(s = 1)". (20) 

n=0 
Lastly, suppose that we consider the logarithmic mean identity associated with 
the arithmetic function a(n) = 1, which, of course, is generated by ¢(s). Then one 


can show that [3, p. 370] 


a Ps lL, L (on) > ee sinu 
og —- = x— ~logx— = log(27) — — u 
= oa 2 g 2 2 ar 2nnx U 
1 1 = B,(x — [x]) 
~x- 3 loex- 5 log(2z) — x @—Dien’ (21) 


n=2 


as x tends to infinity, upon successive integrations by parts, where B,(x), n > 2, 
denotes the nth Bernoulli polynomial. (Complete details may be found in 
[3, p. 371].) The asymptotic expansion (21) is equivalent to Stirling’s asymptotic 
series expansion for the gamma function. 

We have not recorded all known logarithmic mean identities that can be found 
in the literature. For example, let F(n) denote the number of nonzero integral ideals 
of norm n in either an imaginary quadratic number field or a real quadratic number 
field. Then in each of these cases, identities for }>,,_, F(n) log(x/n) are derived 
in [3]. Redmond [16] also considered the case of an imaginary quadratic number 
field, but with the infinite series of Bessel functions replaced by an error term. 
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Universal Thickening of the Field 
of Real Numbers 


Alain Connes and Caterina Consani 


To the memory of M. Krasner, in recognition of his 
farsightedness. 


Abstract We define the universal thickening of the field of real numbers. This 
construction is performed in three steps which parallel the universal perfection, the 
Witt construction and a completion process. We show that the transposition of the 
perfection process at the real archimedean place is identical to the “dequantization” 
process and yields Viro’s tropical real hyperfield R?. Then, we prove that the 
archimedean Witt construction in the context of hyperfields allows one to recover 
a field from a hyperfield, and we obtain the universal pro-infinitesimal thickening 
Roo of R. Finally, we provide the real analogues of several algebras used in the 
construction of the rings of p-adic periods. We supply the canonical decomposition 
of elements in terms of Teichmiiller lifts, we make the link with the Mikusinski 
field of operational calculus and compute the Gelfand spectrum of the archimedean 
counterparts of the rings of p-adic periods. In the second part of the paper we also 
discuss the complex case and its relation with the theory of oscillatory integrals in 
quantum physics. 


1 Introduction 


This paper establishes an analogue of the construction of the rings of periods 
of p-adic Hodge theory (cf. e.g. [11-13]) when a p-adic field is replaced by the 
field R of real numbers. We show that the original ideas of M. Krasner, which 
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were motivated by the correspondence he first unveiled between Galois theories 
in unequal characteristics [17], reappear unavoidably when the above analogy is 
developed. The interest in pursuing this construction is enhanced by our recent 
discovery of the Arithmetic Site [6] with its structure sheaf of semirings of 
characteristic 1, whose geometric points involve in a crucial manner the tropical 
semifield R4!. The encounter of a structure of characteristic 1 which is deeply 
related to the non-commutative geometric approach to the Riemann Hypothesis has 
motivated our search for the replacement of the p-adic constructions at the real 
archimedean place. 

We recall that the definition of the rings of p-adic periods is based on three 
main steps. The first process (universal perfection) is a functorial construction which 
links a p-perfect field L of characteristic zero (e.g. the field C, of p-adic complex 
numbers) to a perfect field F(L) of characteristic p (cf. Appendix 2 for notations). 

The second step is the p-isotypical Witt construction which defines a functorial 
process lifting back from characteristic p to characteristic zero (cf. Appendix 3). 

Finally, in the third step one defines various rings of periods B, by making 
use of 


— The integer ring Of C F(C,) and other natural rings obtained from it. 
— The canonical covering homomorphism 6 : W(@r) — C, (cf. Appendix 4). 
— Natural norms of p-adic type and corresponding completions. 


These constructions then provide, for each ring of periods, a functor from the 
category of p-adic Galois representations to a category of modules whose definition 
no longer involves the original (absolute) Galois group but trades it for an action of 
the Frobenius ¢, a differential operator etc. There is a very rich literature covering 
all these topics, starting of course with the seminal, afore mentioned papers of J.-M. 
Fontaine; we refer to [1] for a readable introductory overview. 

For the field R of real numbers, Galois theory is of little help since Aut(R) is the 
trivial group and Autg(C) = Z/2Z is too small. However, the point that we want 
to emphasize in this paper is that the transposition of the above three steps is still 
meaningful and yields non-trivial, relevant rings endowed with a canonical one 
parameter group of automorphisms F,, 4 € R%., which replaces the Frobenius ¢. 
More precisely, the analogues of the above three steps are 


1. A dequantization process, from fields to hyperfields. 

2. An extension (W-models) of the Witt construction that lifts back structures from 
hyperfields to fields. 

3. Several completion processes which yield the relevant Banach and Frechet 
algebras. 


We discovered that the two apparently unrelated processes of dequantization on 
one side (cf. [20]) and the direct transposition of the perfection process to the real 
archimedean place on the other, are in fact identical. The perfection process, in the 
case of the local field R, starts by considering the set F(R) made by sequences 
X = (Xn)n>0. Xn € R, which satisfy the condition x* 41 = %n forall n € Z5o. Here, « 
is a fixed positive odd rational number (i.e. |k|2 = 1) such that |K|oo < 1. 
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In Sect. 2 we prove that by applying the same algebraic rules as in the construc- 
tion of the field F(Z) in the p-adic case, one inevitably obtains a hyperfield (in 
the sense of M. Krasner) R’. The hyper-structure on R? is perfect, independent of 
the choice of « and it turns out that R° coincides with the tropical real hyperfield 
introduced by O. Viro in [25] as the dequantization of R. The main relevant feature 
of the hyperfield R? is to be no longer rigid (unlike the field R) and some of its 
properties are summarized as follows 


Theorem. (i) R? is a perfect hyperfield of characteristic one, i.e. x +x = x, Vx € 
R? and for any odd integer n > 0, the map R? > x +> x" is an automorphism 
of R’. 

(ii) Aut(R?) = R., with a canonical one parameter group of automorphisms 0}, 
AER. 

(iii) The map x —> xo defines a bijection of sets R° = R; the inverse image of the 
interval [—1, 1] C R is the maximal compact sub-hyperring C C R’. 


No mathematician will abandon with light heart the familiar algebraic framework 
of rings and fields for the esoteric one of hyperstructures. When Krasner introduced 
hyperfields (and hyperrings) motivated by the correspondence he had unveiled 
between the Galois theories in unequal characteristics [17], the main criticism 
which prevailed was that all the interesting and known examples of hyperfields 
(and hyperrings) are obtained as quotients K/G of a field (or ring) by a subgroup 
G Cc k™ of its multiplicative group, so why not to encode the structure by the 
(classical) pair (K, G) rather than by the hyperfield K/G. The second step (ii) in our 
construction exploits exactly that criticism and turns it into a construction which has 
the additional advantage to parallel the classical p-isotypical Witt construction. 

Given a hyperfield H, a W-model of H is by definition a triple (K, p, tT) where 


— Kisa field 
— p:K — H isa homomorphism of hyperfields 
— t:H — K isa multiplicative section of p. 


The notion of morphism of W-models is straightforward to define. A W-model of 
FH is said to be universal if it is an initial object in the category of W-models of 
H.. When such universal model exists it is unique up to canonical isomorphism and 
we denote it by W(H). In Sect. 3 we show that the universal W-model of R” exists 
and it coincides with the triple which was constructed in [3, 5], by working with 
the tropical semi-field R4* of characteristic one and implementing some concrete 
formulas, involving entropy, which extend the Teichmiiller formula for sums of 
Teichmiiller lifts to the case of characteristic one. We let W = Frac(Q[IR7]) be 
the field of fractions of the group ring of the multiplicative group RY, tw : Ri > 
Q[R%] C W be the canonical group homomorphism and py : W > R? ~ R be the 
map defined by 
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Pw deity 5) / 2 Birw 0) =n (3) a 


where x9 = sup{x;}, yo = sup{y;} and a;, Bj € Q. 
Theorem. The triple (W = Frac(Q|R*]), pw. tw) is the universal W-model for 


H = R’. The homomorphism pw induces an isomorphism of hyperfields W/G = 
R’, where G = Ker(pw : WX > R?*). 


In the p-isotypical Witt construction R ++ W(R), the respective roles of 
(H, K, p, tT) correspond to the initial perfect ring R, the p-isotypical Witt ring W(R), 
the residue homomorphism p : W(R) — R and the Teichmiiller lift r : R > W(R). 
It is important to underline here the fact that while homomorphisms of fields are 
necessarily injective this restriction no longer applies to hyperfields. This is the 
reason why one can work directly with fields in the definition of W-models. The 
subring Wz(H) C W(H) generated by the range of the section t provides then 
a ring theoretic structure, at the real archimedean place, and it generates the field 
W(A). Moreover, the definition of the subring Wz(R) C W(H) is meaningful for 
any sub-object R C H. This construction applies in particular to the maximal 
compact sub-hyperring @ C R? and it provides the starting structure from where 
one develops the construction of the real archimedean analogues of the various rings 
used in p-adic Hodge theory. Finally, the functoriality of the set-up of the universal 
W-models yields, for H = R’, a canonical one parameter group of automorphisms: 


F, = W(6,) € Aut(W(R’)), Ae R% (1) 


which are compatible with (i.e. preserve globally) the various subrings defined 
above. The analogue of the covering map @ is defined, likewise in the p-adic case, 
as the unique ring homomorphism: 


6: Wo(R’) > R, 6(x(x)) = x0, Vx = On)n>o € F(R) = R’. (2) 


Using the map 6 we define the universal formal pro-infinitesimal thickening of R as 
the Ker(0)-adic completion of Wo(R’), i.e. Roo = lim Wo(R?)/Ker(@)”. 

In Sect. 4, we show (cf. Theorem 3) that Roo is more substantial than the ring 
R[[T]] of formal power series with real coefficients. For each non-trivial group 
homomorphism ¢ : R?. — R, we define a surjective ring homomorphism Rj, —> 
R[[7]]. In fact we find that the real vector space Qz = Ker(@)/Ker(6)? is infinite 
dimensional and it is inclusive of the R-linearly independent set of natural periods 
Ip = [p] — p, indexed by prime numbers. Theorem 4 gives the presentation of Qz 
by generators e(x), x € R, and relations ((A), (B), (C)), which coincide with the 
defining relations of the argument of the 1.5 logarithm (cf. [3, 16]) intrinsically 
related to the entropy function. 
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Theorem. The space Ker(@)/Ker(0)? is the infinite dimensional R-vector space 
Q2Qp generated by the symbols ¢(x), x € R, with relations 


(A) : e(1 —x) = e(x) 


(B) :o(4-+») =) + = ye( >) tye -*). Vy € {0,1} 


(C) : xe(1/x) = —e(x), Vx £0. 


The above real archimedean analogue of the p-isotypical Witt construction is purely 
algebraic and the archimedean analogue of the p-adic topology plays a dominant 
role in the third step (iii) of our construction (cf. Sect.5). This process yields R- 
vector spaces and, in direct analogy with the theory of p-adic rings of periods, the 
definition of several Banach and Frechet algebras obtained as completions using 
the direct analogues of the ||.||, norms of the p-adic theory (cf. Appendix 4). In 
Theorem 5 we show that the real archimedean analogue B?* of the ring B+ of 
p-adic Hodge theory (cf. Appendix 4) is the Banach algebra of convolution of finite 
real Borel measures on [0, oo). In Sect. 6 we investigate the ideals and the Gelfand 
spectrum of the Frechet algebras obtained from B’;* by completion with respect 
to the archimedean analogue of the norms ||f||, used in p-adic Hodge theory (cf. 
Appendix 4). In Theorem 8 we show that the Gelfand spectrum Spec (Be) of the 
Frechet algebra BE: o is the one point compactification Y = Ct U {oo} of the open 
half-plane Ct = {z € C | R(z) > O}. It follows that the above algebras can be 
faithfully represented as algebras of holomorphic functions of the complex variable 
z € Y, and moreover 


— The Teichmiiller lift [x] of an element x € [—1, 1] is given by the function z 
sign(x)|x|*. 
— For p > 0 the analogue of the ||.||, norm is given, with a = by 


ee oe 
log p’ 
fll = [ edb). VFO = i edb (€) 


where the function #(&) is of bounded variation. 
— The one parameter group F, acts on C* by scaling z > Az and it fixes 00 € Y. 


In Appendix 5 we explain the relation of the point of view taken in this paper and 
our earlier archimedean Witt construction in the framework of perfect semi-rings of 
characteristic one. It is simply given by the change of variables z = + as explained 
in (177). 

Our analogy with the p-adic case is based on the following canonical decompo- 
sition of the elements of Bet (cf. Sect.5, Theorem 6; the symbol ~ denotes the 
hyperaddition in R’, cf. (8)) 
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Theorem. Let f € Bet Then, there exists a real number sy > —oo and a real 
measurable function s > so +> f; € [—1, 1] \ {0}, unique except on a set of Lebesgue 
measure zero, such that f; ~ f; = fs for s < t and so that f = i. [fsle*ds. 


We use this canonical decomposition as a substitute of the p-adic decomposition 
of every element x € B’*+ in the form x = ooo Mnla”, with x, € Op, Wn 
(we refer again to Appendix 4 for notations). The relation between the asymptotic 
expansion of f(z) for z > oo in powers of T = t and the Taylor expansion at § = 0 
of the function #(&) of the formula f(z) = a e §*d@(E) is given by the Borel 


transform. In the simplest example ¢(€) = te which corresponds to the Euler 
divergent series: 


f@ =f0/T) ~ ey"tT" = oes 


the canonical decomposition is given by the fast convergent integral f = 
Io’ Uile*ds where fe = e'* © [-1,1] \ {0} for all s > 0. By exploiting 
Titchmarsh’s theorem we then show that, in general, the leading term f;, in the 
expansion has a multiplicative behavior in analogy with the p-adic counterpart. This 
part is directly related to the construction of the Mikusinski field: in Proposition 9 
we provide the precise relation by constructing an embedding of the algebra Bt, in 
the Mikusinski field Wt. 

In Sect. 7 we start the development of the complex case, namely when the local 
field R is replaced by the field C of complex numbers. We describe an intriguing 
link between the process of dequantization of C and the oscillatory integrals which 
appear everywhere in physics problems. We illustrate this connection by treating in 
details the case of the Airy function and by showing how the asymptotic expansion 
of this function (already obtained by Stokes in the nineteenth century) involves 
an hypersum in the hyperfield quotient of a field of complex valued functions by 
a subgroup of its multiplicative group. More in general, in the context of gauge 
theories in physics, the presence of several critical points is unavoidable and for 
this reason we expect that the formalism deployed by the theory of hyper-structures 
(hyperrings and hyperfields) might shed some light on the evaluation of Feynman 
integrals in that context. 

Motivated by the Wick rotation in quantum physics, which allows one to trade 
an oscillatory integral for an integral of real exponentials, we study a simple “toy 
model” C° of the dequantization of the field of complex numbers by paralleling 
the various steps explained before for the real case. In particular, we prove that 
C? is the natural perfection of the hyperfield 7C introduced by Viro. The infinite 
dimensional, complex vector space 2c¢ = Ker(6)/Ker(@)? naturally associated to 
the universal, formal pro-infinitesimal thickening C,., of C contains two C-linearly 
independent types of periods. The first set is the natural complexification of the set 
of real periods z,, while the second period ¢ is purely complex and it corresponds 
to 2iz. 
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Appendix | reports a table which describes the archimedean structures that we 
have defined and discussed in this paper and their p-adic counterparts. 

In Appendix 2 we provide a short overview of the well-known construction of 
universal perfection in number theory. 

In Appendix 3 we develop a succinct presentation of the isotypical Witt 
construction R + W/(R) as a prelude to the theory of W-models. The objects of 
the basic category are triples (A, ,t). The algebraic geometric meaning of p¢ is 
clear (0 : A — R is a ring homomorphism) while the algebraic significance of t 
(a multiplicative section of o) only becomes conceptual by using the F,-formalism 
of monoids. 

Finally, in Appendix 4 we shortly review some relevant constructions in p-adic 
Hodge theory which lead to the definition of the rings of p-adic periods. 


2 Perfection in Characteristic One and Dequantization 


In this section we prove that the functor defined by Fontaine ([11], Sect. 2.1) which 
associates to any p-perfect field L a perfect field F(L) of characteristic p has an 
analogue at the real archimedean place. We find that starting with the field R of 
real numbers and taking the limit of the field laws yields unavoidably a hyperfield 
structure (cf. [4, 17], Sect. 2). This construction shows on one side that hyperfields 
appear naturally as limit of fields and it also provides on the other side an ideal 
candidate, namely the tropical real hyperfield R? introduced in [25] (Sect. 7.2), as 
a replacement at the real archimedean place, of Fontaine’s universal perfection 
structure. We refer to Appendix 2 for a short overview of Fontaine’s original 
construction. 
Given a p-perfect field L, one defines a perfect field F(L) of characteristic p 


F = F(L) = {x = @)nzolx™ € L, OTP = x} (3) 
with the two operations (x, y € F) 


(x + yn = im OH FO YP yy = MY. (4) 


Formulas (3) and (4) are sufficiently simple to lend themselves to an immediate 
generalization. 

Let us start with a topological field & and a rational number « and let consider 
the following set 


F= F(é) = {x = (Xn) n>0lXn € 6, (Xn-41)* = Xn} (5) 
with the two operations (x, y € F) 


(x - y)n = fim n+ + ee ile (xy) n = XnyYn- (6) 
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If & is a p-perfect field one has x = p and thus the p-adic (normalized) absolute 
value yields |x|, = 7 < 1. When & = R, one chooses « such that the usual 
archimedean absolute value yields |k|oo < 1. We assume that the 2-adic valuation 
of « is zero (i.e. that the numerator and the denominator of « are odd), so that the 
operation x +> x* is well-defined on R. The following theorem implements the point 
of view of [25] to establish a precise link between the process of “dequantization” 
in idempotent analysis and the universal perfection construction. 


Theorem 1. (1) The map F 3 x — xo € Rdefines a bijection of sets and preserves 
the multiplicative structures. 

(2) The addition defined by (6) is well defined but not associative. 

(3) The addition given by the limit of the graphs in (6) is multivalued, associative 
and defines a hyperfield structure on F which coincides with the real tropical 
hyperfield R? of [25]. 


Proof. (1) By construction the map x ++ x“ is a bijection of R, thus the map F 5 
xt x € Ris a bijection of sets. It also preserves the multiplicative structure, 
due to the definition of the multiplication on F as in the second formula in (6). 

(2) To avoid confusion with the ordinary addition, we denote the addition in F, as 
in the first formula in (6) and expressed in terms of x) € R, by x +’ y. More 
explicitly, it is given by the formula 


—m, ym 


x+/y= lim "+y* "Sk 
m—-> oOo 


and is easy to compute. In fact, it is given by 


x, if |x| > |y| or x = y; 
x+'y= 4 y, if |x] <|ylorx =y; (7) 
0, ify = —x. 


In particular one finds x +’ x = x, Vx € F. The associative law cannot hold 
since for any y € F with |y| < |x| one has 


(yt! x) +o -x = x4!’ -x=0, yt’ 4+’ —-x) =yt+/0=y. 


(3) For each non negative integer m, the graph G,, of the addition conjugated by the 
map x +> x“ " is connected (cf. Fig. 1). When m —> oo these graphs converge, 
as closed subsets of R x R x R (as Fig. 2 shows) to the graph G of the addition 
~ on the hyperfield R’. The obtained hyperaddition of real numbers is the 
following 


Xx, if |x| > |y| orx = y; 
a—y= ry, if || <lylorx=y; (8) 
[-x,x], if y = —x. 
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Fig. 1 Graph of the addition 
in R after conjugation by 
xh x3, ie. of OF + y3)3 


Fig. 2. Graph of the addition 
in R after conjugation by 

x > x*” for n large. It 
converges to the graph of a 
function which is multivalued 
on the line y = —x 


One can see in Fig. 2 how the limit of the graphs of the conjugates of addition 
becomes multivalued on the anti-diagonal y = —x and fills up the interval 
[—x, x]. One checks directly that with this hyperaddition R? is a hyperfield. 

oO 


Notice that replacing the sum (7) by the multivalued one (8) is the only way of 
making the latter one associative without altering the first two lines of (7). Indeed, 
the fact that 0 € x + (—x) implies that for any y with |y| < |x| one has 


yeyt + (Cx) = O44) + x) = 44+ (9). 


Remark I. The second statement of Theorem | shows that there is no “formal” 
proof of associativity when addition is defined by (6) and assuming that the limit 
exists. The third statement of the theorem implies that hyperfields naturally arise 
when one considers limits of field structures on the same topological space, since 
as the proof of the statement (3) shows, the limit of univalent maps giving addition 
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may well fail to be univalent. The abstract reason behind the associativity of the 
hyperlaw given by the limit G of the graphs G,, of the conjugate +, of the addition 
in R is that for any convergent sequence z, — z, z € G(x, y) there exist convergent 
sequences x, —> x and y, — y such that z, = x, +n yn. This is easy to see if |y| < |x| 
or y = x since (with m an odd integer) 


Ae(x" + "Ul" = (L+ Q/xymy ie, 


If y = —x the result also holds since the range of the real map € +> ((1+e)”—1)!/", 
for |e| < - is connected and fills up the interval (—1, 1) when m — oo (m odd). 


Definition 1. (1) A hyperfield H is of characteristic one if x + x = x, Vx € H. 
(2) A hyperfield H of characteristic one is perfect if and only if for any odd integer 
n > 0, the map H 3 x +> x” is an automorphism of H. 


Proposition 1. (i) The real tropical hyperfield R® is perfect and of 
characteristic one. 

(ii) The map A +> 6,, 6,(x) = sign(x)|x|*, Vx € R, defines a group isomorphism 
0: RY > Aut(R’). IfA = , © Q is odd (i.e. both a and b are odd integers) 
one has 0; (x) = x*, Wx ER. 

(iii) The compact subset [—1,1] C R° is the maximal compact sub-hyperring O 


of R°. 
(iv) The hyperfield R? is complete for the distance given by d(x, y) = |xo — yo|, for 
xyerF= R’. 


Proof. (i) follows from the equality 1 + 1 = 1 which holds in R’. The perfection 
follows from (ii). 

(ii) The maps 6, are automorphisms for the multiplicative structure. They also 
preserve the hyperaddition ~ on R’. By construction, they agree with x +> x* 
when A € Q’ is odd. Let a € Aut(R’). Since a is an automorphism of the 
multiplicative group R* = R%. x {£1}, one has a(—1) = —1, and a preserves 
globally RY. The compatibility with the hyperaddition shows that w defines an 
increasing group automorphism of R”., thus it coincides with 0). 

(iii) The compact subset [—1, 1] C R? is stable under multiplication and hyperad- 
dition. For any element x € R? with x ¢ [—1, 1], the integer powers x” form an 
unbounded subset of R’, the maximality property then follows. 

(iv) By Theorem | (1), the map x — xo € R defines a bijection of sets which is an 
isometry for the distance d(x, y). The conclusion follows. 

oO 


We refer to Appendix 2 (Proposition 16) for the p-adic counterpart of the above 
statements. 
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3 The Algebraic Witt Construction for Hyperfields 


In this paper we use the following formulation of the classical p-isotypical Witt 
construction which associates to a perfect ring R of characteristic p the strict p-ring 
W(R) of Witt vectors. We denote by pr : W(R) — R the canonical homomorphism 
and tr : R + W(R) the multiplicative section given by the Teichmiiller lift. Next 
proposition is an immediate corollary of Theorem 1.2.1 of [12]. 


Proposition 2. Let p be a prime number and R a perfect ring of characteristic p. 
The triple (W(R), pr, Tr) is the universal object among triples (A, p,t) where A is a 
(commutative) ring, p : A — Ris a ring homomorphism with multiplicative section 
t:R— Aand the following condition holds 


A= lim A/Ker(p)". (9) 


n 


We refer to Appendix 3 for an elaboration on the nuance, due to the presence of the 
multiplicative lift t in the currently used formulation, with respect to the classical 
notion of universal p-adic thickening. Next, we proceed in a similar manner with 
hyperfields, by suitably transposing the above set-up. 


Definition 2. Let H be a hyperfield. A Witt-model (W-model) of H is a triple 
(K, p,T), where K is a field, p : K — H is a homomorphism of hyperfields, and rt 
is a multiplicative section of p. 


A morphism (Kj, ~1, 7) > (Ko, p2, T2) of W-models of H is a field homomorphism 
a : K, — K> such that the following equations hold 


TM =QOT, pi = prod. (10) 


Definition 3. A W-model for a hyperfield H is universal if there exists a unique 
morphism from this model to any other W-model of H. 


If a universal W-model exists then it is unique up-to unique isomorphism and in that 
case we denote it by (W(H), ox, Ty). 

Next, we study the W-models for the real tropical hyperfield H = R’. As a first 
step we construct a particular W-model for H and then we shall prove that in fact it 
is the universal one. 

Let W = Frac(Q[R% ]) be the field of fractions of the rational group ring Q[R?,] 
of the multiplicative group R%.. We let ty : R% — W be the canonical group 
homomorphism tw (x)(= [x]) and define the map py : W > R’ ~ Rby 


pw ( So aiew(x/ > Brew») ) = sien(S) a 
i j 


yo 
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where xp = sup{x;}, yo = sup{y;} and a;, B; € Q. We extend ty to a multiplicative 
section ty : R? > W of pw by setting tw(0) = 0 and tw(—x) = —tyw(a). It 
is straightforward to verify that (W, pw, tw) is a W-model of R? (the details are 
provided in the next proof). We claim that it is also the universal one, moreover 
it describes the full structure of the hyperfield R’ as the quotient of a field by a 
subgroup of its multiplicative group. 


Theorem 2. The triple (W = Frac(Q[R%]), pw. tw) is the universal W-model for 
H=P, 7 

The homomorphism pw induces an isomorphism of hyperfields W/G —> R’, 
where G = Ker(pw : W* > R’”). 


Proof. First we show that the triple (W = Frac(Q[R*]), pw. tw) is a W-model and 
it also fulfills the second property. The map tw : R? > W,x > tw(x) = [dX] is 
multiplicative by construction and it is immediate to check that pw o tw = id, thus 
Tw = [] is a multiplicative section of the map py defined by (11). To understand 
pw it is useful to consider the field homomorphism ® : W > .@(C), where .@(C) 
is the field of meromorphic functions on C, defined by the formula 


ei UX; 
@ Y tw (xi Y tw(y; = a 12 
( at Tw (xi) / pw) Sai (12) 


Then, keeping in mind the notation of (11), one deduces the following interpretation 
of pw 


Bo yo 


(since for x; < x0, yj < yo one has x <K XG, yj < yg when z + +00). One can thus 
state that 


@(X)(z) ~ (2) (2) when z > -Foo (13) 


Ja € Ry, &(X)(2n + 1) ~ apw(X)"*! whenn — +00. (14) 
This means that one can define pw : W > R’ by the formula 


pw(X) = lim (®(X)(2n + 1))i/OrrD, (15) 


Notice that (15) is well defined because the odd roots are uniquely defined in R. The 
map py is clearly multiplicative, next we show that it induces an isomorphism of 
hyperfields W/G — R’, where the subgroup G is the kernel at the multiplicative 
level, ie. G = ker(pw : W* > R>”). By definition the underlying set of G 
is made by the ratios (}7, a:tw(xi))/(7; BjtwO)) € W, such that: x9 = yo and 
sign(a9)=sign(By). What remains to show is that the hyper-addition in R’ coincides 
with the quotient addition rule x +g y on W/G = R. By definition one has 


x+oy = tpw(X + Y)| pw(X) = x, pw(Y) = y}. (16) 
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We need to consider three cases: 


(a) Assume |x| < |y|. Then for some a,b € Ry one has (X)(2n+1) ~ ax?"*! and 
@(Y)(2n + 1) ~ by*"*! and thus it follows that ®(X + Y)(2n + 1) ~ by?"t!. 
Then one gets pw(X + Y) = y. 

(b) Assume x = y. Then for some a,b € Ry one has ©(X)(2n + 1) ~ ax"T!, 
@(¥)(2n + 1) ~ bx*"*! and O(X + Y)(2n + 1) ~ (a+ b)x*"*!, thus one gets 
pw(X + Y) = x, sincea+b>0. 

(c) Assume y = —x. In this case, we have with a, b € Ry: ®(X)(2n+1) ~ ax*"*!, 
@(Y)(2n + 1) ~ byt! = —bx?"*!, In this case we can only conclude that 
|®(X + Y)(2n + 1)| S c|x|?”*! which gives |pw(X + Y)| < |x|. Moreover, 
by choosing X = atw(x), Y = btw(y) for suitable real a,b, we conclude that 
{x, y} C x +c y. In fact by taking z, |z| < |x|], X = tw(x) + tw(z), Y = tw) = 
—ty/(x), one gets pw(X + Y) = zand hence x+ y is the whole interval between 
x and y. 


This shows that the quotient addition rule on W/G coincides with (8). 

Finally, we show that the triple (W,pw,tw) is the universal W-model 
for H = R’. 

Let (K,p,t) be a W-model for H = R’. First we prove that the field K is of 
characteristic zero. Indeed, one has p(1) = 1 and thus for any positive integer p one 
derives 


P(p) € pl) +--+ + pl) = {hh, 
-_—$, 
p—times 
so that p # 0, Vp. Next, we note that t(—1) = —1 € K. Indeed, since 1 is 


multiplicative one has t(—1)? = t(1) = 1 and t(—1) F 1 since (p 0 t)(—1) = 
—1 + p(1) = 1. By multiplicativity of t we thus get 


t(—x) =-t(x) VWreH=R’. (17) 


We first define the homomorphism @ : W > K on R = Q[R%] C W. We denote an 
element in R as x = ajtw(x;) with a; € Q. We define a : R > K by the formula 


a (x camts)) = So air (xi). (18) 


By applying the property (17) we get a(tw(x)) = t(x), Vx € H = R’. We check 
now that a : R > K is injective. Let R 3 x = )0, a;tw(x;) A 0. Let x9 = max{x;}. 
One then has 


pla(x)) = p (x exts)) € )/ lait (xi). 
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One has p(a) = 1 fora € Q,a > 0 and p(—1) = —1 and hence p(a;t(x;)) = 
€;pT(x;) = €;x; with e; the sign of a;. Thus p(a(x)) € aos €iX; = €oXo ~ O and the 
injectivity is proven. From the injectivity just proven it follows that a : R > K 
defines a field homomorphism a : Frac(R) — K. By construction one has 
a(tw(x)) = T(x) Vx ER. 

It remains to show the second equality of (10). Consider p o a: to show that 
this is equal to py it is enough to prove that they agree on R since both maps are 
multiplicative. Let x = )°;aitw(xi) € R, then one has p(a(x)) € ¥¢; p(ait(xi)) 
and by the above argument one gets p(a(x)) = €ox%0 = pw(x). This shows that the 
morphism @ exists and it is unique because a(tw(x)) is necessarily equal to t(x) so 
that by linearity we know @ on R and hence on Frac(R). Oo 


The following functoriality property will be applied later in the paper. 


Proposition 3. Let H be a hyperfield and assume that the universal W-model W(H) 
for H exists. Then there is a canonical group homomorphism 


W : Aut(H) > Aut(W(H)), ~~ W(0) =a. (19) 


Proof. Let 9 € Aut(H) be an automorphism of H. Let (W(H) = K,p,T) be the 
universal W-model for H. Then, we consider the triple: (K, p’ = 0~!op, t! = r08). 
One sees that p’ is a homomorphism of hyperfields, that t’ is multiplicative and that 
p(t! (x)) = 87! (p(t (O(x)))) = x. Thus (K = W(A), p’, t’) is also a W-model of H. 
Then it follows from universality that there exists a field homomorphisma : K > K 
such that the rules (10) hold, and in particular t o 6 = wo t. From this and the fact 
that a is the identity when @ is the identity one deduces the existence of the group 
homomorphism (19). oO 


When H = R? we derive from the above proposition the existence of a 
one parameter group of automorphisms of W(R”) given by the (images of the) 
6, € Aut(R’) (cf. Proposition 1). These operators form the one parameter group 
of Frobenius automorphisms 


W(0,) = Fy € Aut(Frac(Q[R}])). (20) 
The universal W-model W(H) of a hyperfield H (when it exists) inherits automati- 


cally the refined structure of the field of quotients of a natural ring 


Proposition 4. Let H be a hyperfield with a universal W-model (W(H), p,t). Let 
Wz(H) C W(A) (resp. Wo(H) C W(A)) be the (integral) subring (resp. sub Q- 
algebra) generated by the t(x)’s, x € H. Then one has 


W(H) = Frac(Wz(HA)) = Frac(Wo(A)). (21) 
Proof. Let K = Frac(Wz(H)) C W(A). The map t : H — W(H) has, by 


construction, image in K and using the restriction px of p : W(H) > H to K 
one gets a W-model (K, px, tx) for H. Thus by universality there exists a field 
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homomorphism a : W(H) — K such that w o tx(x) = t(x), Vx € H. Hence 
a is surjective on K (as K is generated by the t(x)’s) and also injective (as field 
homomorphism). Then a : W(H) = K isa field isomorphism. A similar proof 
shows the second equality in (21). Oo 


Example 1. Let H = sign = {0,+1}, be the hyperfield of signs (cf. [4], 
Definition 2.2) then Wz(sign) = Z. 


Example 2. Let H = R’, then Wo(R’) = Q|R*]. 


In p-adic Hodge theory (cf. [8], Sect.5) one defines for a finite extension F of 
Q» with residue field F, and for any (real) valued complete, algebraically closed 
field F of characteristic p extension of an algebraically closed field k|F,, a ring 


homomorphism 6 : We,(Gr) > Cp, OO n> onl”) — ae, xO” (a = me is 
a chosen uniformizer of @;). At the real archimedean place, we have the following 


counterpart 


Proposition 5. There exists a unique ring homomorphism @ : Wo(R°) > R such 
that 


6([x]) = O(c(x)) =x =x, VxeR’. (22) 


Proof. It follows from Proposition 4 that Wo(R’) = Q[R%] and thus the natural 
map R% — R extends by linearity and uniquely to a ring homomorphism ([] = 7) 


f) ( » a) = X ajx; € R. (23) 


i 
oO 


4 Universal Formal Pro-infinitesimal Thickening 
of the Field IR 


Theorem 2 states the existence of a universal algebraic object whose definition is 
independent of the completeness condition (9) of Proposition 2. Given a universal 
object among triples (A, p,t) where A is a (commutative) ring, p : A > Risa 
ring homomorphism with multiplicative section t : R — A, the corresponding data 
obtained by passing to the completion lim A/Ker(p)” is automatically universal 
among the triples which fulfill (9). This suggests to consider the homomorphism 
0: Wo (R’) — R of Proposition 5 and introduce the following 


Definition 4. The universal formal pro-infinitesimal thickening Roo of R is the 
Ker(0)-adic completion of Wo(R’), ie. 


— 1 b n 
Roo = lim Wa(R )/Ker(@)". 


n 
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The following theorem shows that Roo has a richer structure than the ring R[[T]] of 
formal power series with real coefficients. In fact we prove that the real vector space 
Ker(6)/Ker(@)? is infinite dimensional. 


Theorem 3. (i) Let £: R% — R be a group homomorphism, then the map 


(ii) 


(iii) 


F(X) (2) = Yo ayel8OFCO VO) | YX = Yailxi] € WoR’) (24) 


L L 


defines a ring homomorphism &% : Wo(R’) — C®™(R) to the ring of smooth 
real functions and 


0(X) = A(X)(1), VX € Wo(R’). (25) 


The Taylor expansion at z = 1 induces a ring homomorphism 


Roo  Rilz— 1], (26) 


which is surjective if x€(x) + (1 —x)€(1 — x) 4 0 for some x € R?. 
Ker(@)/Ker(@)? is an infinite dimensional real vector space and the “periods” 
1, = [p|—p € Ker(@)/Ker(6)?, for p a prime number, are linearly independent 
over R. 


Proof. (i) For any z € R the map log +(z — 1)£ defines a group homomorphism 


(ii) 


(iii) 


R¥. — Rand the conclusion follows. 

It follows from (i) and (148) that one obtains the ring homomorphism (26) 
since for X € Ker(@)” the function .%(X)(z) vanishes of order > n at z = 1. 
We show, under the assumption of (ii), that % as in (26) is surjective. Let 
x € R with x€(x) + (1 —x)€(1 — x) 4 0. One has s(x) := 1 — [x] — [1 —x] € 
Ker(0); the first derivative of %(s(x))(z) at z = 1 is equal to —(xé(x) + (1 — 
x)€(1 — x)) and it does not vanish. Using the R-linearity which follows from 
Wo(R’)/Ker(@) = R and by implementing the powers s(x)”, one derives the 
surjectivity. 

One has x, := [n] — 7 € Ker(@) for any integer n and hence for any prime 
number n = p. Next we show that these elements are linearly independent in 
Ker(6)/Ker(6)?. The latter is a vector space over Wo(R’)/Ker(9) = R, and 
the multiplication by a real number y € R is provided by the multiplication by 
any s € Wo(R’) such that @(s) = y. In particular we can always choose the 
lift of the form s = a[b] where a € Q and b € R®. Assume now that there is a 
linear relation of the form 


X =) ajlbilzp, € Ker(6) (27) 
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where a; € Q, b; € Ri and p; are distinct primes. Let £ : RY > Rbea 
group homomorphism, then by using (ii) we see that %(X) vanishes at order 
> 2 at z = 1, and thus (Fi A(X) = 0. Using mp, € Ker(@), we derive, 
using (24), 


d 
(=), Fe(aj[bi]xp,) = aibipil (pi) 


so that )°,a;bip:€(p;) = 0. Since the logarithms of prime numbers are 
rationally independent, and since the additive group R is divisible, hence 
injective among abelian groups, one can construct a group homomorphism 
£: Ri} — R such that the values of ¢(p;) are arbitrarily chosen real numbers. 
In view of the fact that the above relation is always valid, we derive that all the 
coefficients a;b;p; must vanish and that the original relation is therefore trivial. 

Oo 


Next, we extend the above construction to obtain linear forms on Ker()/Ker(@)?. 


Lemma 1. (i) Let dé : Wo — R be a Q-linear map such that 


d(fg) = O(f)5(g), Vg € Ker(6). (28) 


Then & vanishes on Ker(6) and it defines an R-linear form on Ker(0) /Ker(6)?. 
(ii) Let :R— Rbe such that y(—x) = —y(x) for all x € Rand 


V(x(y+z)-xW(+z) = WOy)-xVO) FV z)—-xW (2), Vany,zER (29) 


then the following equality defines a Q-linear map by : Wo —> R fulfilling (28) 


5o( Yo atsl) = Dawa), Va €Q 4 ER. (30) 
J 


j 


Proof. (i) Forf,g € Ker(@), it follows from (28) that 6(fg) = 0, thus 6 vanishes 
on Ker(6)*. The action of s € R on g € Ker(@)/Ker(6)? is given by fg for any 
f € Wo with O(f) = s. Thus the R-linearity of the restriction of 6 to Ker(@) 
follows from (28). 

(ii) By construction the map 6 defined by (30) is well defined since w is odd, 
and Q-linear. To check (28) we can assume that f = [x] for some x € R. One 
then has 


5(fg) — O(/)8(g) = D> b\(Way)) —xW0;)), Vg = >> bibl. (31) 
J J 


The map L : R > R given by L(y) = W(xy)—xy (9) is additive by (29) and thus 


28 A. Connes and C. Consani 


obi Gy) — x0))) = D5 LO) = u( > bw) 
j j j 


When g € Ker(9), one derives 0(g) = a bjy; = 0 and thus one obtains (28). 
oO 


The next statements show how the entropy appears naturally to define “periods”. 


Lemma 2. (i) The symbol s(x) := 1 — [x] — [1 — x] defines a map R > Ker(@) C 
Wo(R’) such that 


(a) : s1 —x) = s(x) 

():s(0+ ») =50) +t -y16(7 =) + b( -*) 
a hi 

(c) : [x]sC./x) = —s(x). 


(ii) The R-linear span in Ker(9)/Ker(@)? of the s(x), for x € R generates 
Ker(@)/Ker(@)?. 


Proof. (i) The symbol [x] extends to R by [—x] = —|x]. The equality (a) holds by 
construction. We check (b) ((c) is checked in the same way). One has 


u-s(25) = 0--bi-0-y—ah bs(— 4) = b+ bl b+) 


vs( 2) + os( *) <1 y= [1 -y—a] + b] +9] = +9) 50). 


(ii) The R-linear span in Ker(@)/Ker(@)7 of the s(x) contains all the [x+ y]s(x/(x+ 
y)) for x + y ¥ 0, and hence all the [x + y] — [x] — [y]. Let f € Ker(@), then a 
non-zero integer multiple of f is of the form 


nf =) “kl-> bd, Soy =o» 
j k j k 


and both > [xj] — [}0; xj] and >°, Lye] — [}2; ye] belong to the R-linear span of 
the s(z) in Ker() /Ker(6)?. 
Oo 


Theorem 4. The space Ker(@)/Ker(@)? is the infinite dimensional R-vector space 
2 generated by the symbols (x), x € R, with relations 


(A) : el —x) = e(x) 
(B) ee +y) =e) +(e 


(C) : xe(1/x) = —e(x), Vx £0. 


)tye(—), Vy € {0,1} 
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Proof. The map e(x) > s(x) € Ker(@)/Ker(6) is well defined and surjective by 
Lemma 2. We show that it is injective. Let M be an R-linear form on (2. Next we 
prove that there exists 6 : Wg — R fulfilling (28), such that 


5(s(x)) = M(e(x)), Vx ER. (32) 


The injectivity then follows using R-linearity to get M(Z) = 0 for any Z in the 
kernel. We now prove (32). Let H(x) = M(e(x)), then, as explained in [16] and 
Remark 5.3 of [3], the function (x,y) = (x + y)A(=): (x, —x) = 0, is a two 
cocycle on the additive group of R, with coefficients in R. This symmetric cocycle 
defines an extension in the category of torsion free divisible abelian groups, i.e. of 
Q-vector spaces, and hence is a coboundary, ¢ = by. With w,(y) := w(xy) one has 
by,.(y, z) = (xy, xz) = xb(y, z). This shows that w fulfills (29) and that replacing 
w by s(w —_1) one can assume that w is odd. Then 6, defined in (30) fulfills (32) 
since 5y,(s(x)) = by(x, 1 —x) = H(a). Oo 


Remark 2. All Lebesgue measurable group homomorphisms ¢ : R% — R are of 
the form x +> A log x, and yield the linear form on 2 given by the entropy function. 
In the next Sect. 5 we investigate the ring homomorphism % of (24) given by the 
measurable choice £ = log. 


5 The R-Algebras of Analytic Functions 
and Their Canonical Form 


This section is concerned with the topological step inherent to the construction of 
the archimedean analogue of the rings which in the p-adic case are the analogues 
in mixed characteristics of the ring of rigid analytic functions on the punctured 
unit disk in equal characteristics (cf. Appendix 4). In Sect.3 we have seen that 
the structure of the Q-algebras Wo(@) C Wo(R?) is inclusive of a one parameter 
group of automorphisms F, = W(,) € Aut(Wo(R’)) (cf. (20)) preserving Wo(@), 
and of the ring homomorphism @ : Wo(R’) — R (cf. Proposition 5). Thus the 
following map defines a ring homomorphism from the Witt ring Wo(@) to the ring 
of real valued functions of one (positive) real variable endowed with the pointwise 
operations 


Wo(@) > F(R), x —> x(z) = O(F-(x)), Vz > 0. (33) 
For x € Wo(@), one has x = )°,a;[x;] with x; € (0, 1] (we keep the notation 
[x] = t(x) of Sect. 3). The function x(z) = }>; aix;, z > 0, is bounded by the norm 


\|xllo = 52; lail. After performing the compactification of the balls ||x||o < R of this 
norm for the topology of simple convergence 


Xp > X <=> x,(z) > x(z), Vz > 0, (34) 
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one obtains the Banach algebra (song which is the real archimedean counterpart of 
the p-adic ring B+ (cf. Appendix 4). 

The main result of this section is stated in Theorem 6 that describes the canonical 
expansion of the elements of Bet, The following Sect.5.1 prepares the ground. 
In Sect.5.3 we construct the Frechet R-algebra obtained by completion for the 
analogue of the ||.||, norms and relate it to the Mikusinski field (cf. Sect. 5.4). 


b, 
5.1 The Algebra B — 


We denote by NBV (Normalized and of Bounded Variation) the class of real 
functions @(€) of a real variable € which are of bounded variations and normalized 
i.e. point-wise left continuous and tending to 0 as € — —oo. In fact we shall only 
work with functions that vanish for § < 0, and say that a real valued function @ 
on (0,00) is NBV when its extension by 0 for € < 0 is NBV. Thus, saying that a 
function ¢ of bounded variation on (0, oo) is normalized just means that it is left 
continuous at every point of (0, 00). We refer to [21], Chap. 8. 
We observe that with the notation of (8), for a, b € R, one has 


a~b=ae|a\> |b], and a=bif |a| = |p}. (35) 


Throughout this section we continue to use the notation [] = t of Sect. 3. 


Proposition 6. (i) Let ¢(&) be a real valued, left continuous function of — € 
(0, co) of bounded variation and let V be its total variation. Then, there exists 
a measurable function u ~ x, € [-1,1] \ {0} of u € [0,V) such that 
Xy ~~ Xy = Xy for u < v and 


Vv (oe) 
i [eul(2du = / e do(é), VzE RX. (36) 
0 0 


Moreover the function u +> x, is unique almost everywhere (i.e. except on a set 
of Lebesgue measure zero). 

(ii) Conversely, given V < oo and a measurable function u +> x, € [—1, 1] \ {0} of 
u € [0, V), such that xy ~ xy = x, for u < v, there exists a unique real valued 
left continuous function $(&), € € (0,00) of total variation V such that (36) 
holds. 


Proof. (i) Let be the unique real Borel measure on [0, 00) such that: w([0, &)) = 
(&), VE > 0. Then the definition of the integral on the right hand side of (36) is 


/ i. e §do(é) = i i edu. 
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(ii) 


The total variation function Ty(€) which is defined as (with #(0) = 0) 


Ty(€) = sup | )>19(&) -—OE-DI]. O= &9 <8 <...<& =€ 


j=l 


is equal to |,2|([O, &)) where |,z| is the positive Borel measure on [0, 00) which 
is the total variation of jz (cf. [21], Theorem 8.14). We set, for u € [0, V) 


S(u) = inf{E € (0, 00) | Tg(€) > u} € [0, 00). (37) 
One has S(u) > S(v), when u > v. Moreover the function S of u is right 
continuous and is finite since the total variation of ¢ is V = V(¢) = an Tg (&). 
00 
Let m be the Lebesgue measure on (0, V(#)). The direct image of m by S is 
equal to the measure ||. Indeed, one has 


Ty() > u <> Su) <é 


which shows that the Lebesgue measure S(m)((0, &)) of the set {u | S(u) < &}is 
equal to T(E) = |u| ((0, €)), VE. Let A(E) be the essentially unique measurable 
function with values in {+1} such that w = h|y| (cf. [21], Theorem 6.14). We 
define the function u b> x, by 


X, = h(S(u))e™ , Vu € [0, V). (38) 
One has |x,| > |xy| for u < v, moreover |x,| = |xy] = > x, = Xy, since 
|Xu| = |x| implies S(u) = S(v). Moreover, since S(m) = |u| one gets 


Vv foe) 
/ F(S(u))dm = / flaw 
0 0 


and taking f(€) = h(€)e~* one obtains (36). The proof of the uniqueness is 
postponed after the proof of (ii). 

Let o(u) = sign(x,), S(u) = —log(|x,|). These functions are well defined 
on the interval J = [0,V). One has S(u) > S(v), when u > v. Let T(&) be 
defined by 


T(E) = sup{u € J | S(u) < &} € [0, V]. (39) 


T(&) is non-decreasing and left continuous by construction, and thus it belongs 
to the class NBV. By hypothesis |x,,| = |x,| = > 2%. = xy, so that the function 
o(u) = sign(x,) only depends upon S(u) and can be written as h(S(u)) where h 
is measurable and takes values in {+1}. We extend h to a measurable function 
h: (0,00) > {+1}. Let x = hdT be the real Borel measure such that 
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[H|((0,€)) = T(E), w=hlyl. (40) 


Then the function ¢(€) = ([0,&)) is real valued, NBV and such that (36) 
holds. Indeed, one has 


T(E) > u => Sy(u) <&, S4(u) = lim S(u + €) 


e>0 


so that the map S associated to the function ¢ in the proof of (i) is equal to 
S+ and thus it agrees with S outside a countable set. Hence the function x, 
associated to @ by (38) agrees with the original x, almost everywhere and one 
gets (36). 
Finally, we prove the uniqueness statement of (i). It is enough to show that the 
function f(z) given by 


fe) = [toa (41) 


uniquely determines x, almost everywhere. It follows from the above discussion 
that it is enough to prove that f(z) uniquely determines the function @(&). By [21], 
Theorem 8.14, the function ¢ € NBV is uniquely determined by the associated 
measure ju. The latter is uniquely determined by f since one has 


Co 
f= Faue, 
0 
Hence f is the Laplace transform of jz and this property determines uniquely the 
finite measure jL. oO 


The following definition introduces the real archimedean counterpart of the ring 
B?+ of p-adic Hodge theory. 


Definition 5. We denote by B’.;* the space of real functions of the form i 7 [x,]du 
where V < oo, andu + x, € [—1, 1] is a measurable function of u € [0, V], such 
that 


Xy— Xy = Xu, for u < vz. 


Then Proposition 6 shows that the functions in Bet are exactly the Laplace 
transforms of finite real Borel measures 


f= [edu (42) 


Moreover, when expressed in terms of jz one gets 


IIfllo -= suptu € [0, V] | xu FO} = |H|([0, oo) (43) 
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This can be seen using (39) to get 


suptu € [0, V] | # 0} = supfu € (0, V] | S(u) < oo} = lim T(E) = |1|((0.00)). 


This shows that Bet is the real Banach algebra of convolution of finite real Borel 
measures on [0, oo). Then we obtain the following result 


Theorem 5. The space B®;*, endowed with the pointwise operations of functions 
and the map f +> |lfllo = sup{u € Rso | x, 4 0}, is a real Banach algebra. 


; b, 
5.2. Canonical Form of Elements of Eo 


In the p-adic case (cf. Appendix 4), every element x € B?* can be written uniquely 
in the form 


y= > [Xp ]20” > XE Or. (44) 
n>—oo 


In the archimedean case one gets an analogous decomposition by applying Proposi- 
tion 6. In the next pages we shall explain this point with care since the decomposition 
of the elements in Bet does not arise by applying (36) naively. 


Theorem 6. Let f ¢ B+. Then there exists 5) > —oo and a measurable function, 
unique except on a set of Lebesgue measure zero, s +> f; € [—-1, 1] \ {0}, for s > so 
such that f; ~ f; = f; for s < t and 


‘— J tiaeras, (45) 


Proof. By Proposition 6 there exists a measurable function u +> x, € [—1, 1] \ {0} 
of u € [0, V) such that x, ~ xy = x, for u < v so that 


Vv 
—_ i: [xuldu. (46) 
0 
Define f; by the equality 
ts = X(V—e-) 5 Vs> so = — log V. (47) 


The function V — e is increasing and d(V — e *) = e “ds, so that (45) follows 
from (46) by applying a change of variables. Oo 
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The scalars R, 1.e. the constant functions in a, are characterized by the condition 
lo.) 
1 fle “ds eR = > f, € sign, Vs (48) 
50 


where sign = {—1,0, 1} is the hyperfield of signs (cf. [4]). This corresponds, in the 
p-adic case, to the characterization of the elements of the local field K C B’* by 
the condition 


» [an]a”" € K — >a, €k, Vn 
n>>—0o 


where k is the residue field of K (cf. [9], Sect. 2.1 and also Appendix 4). 

In the p-adic case, the projection OG; — kp (cf. Appendix 4) induces an 
augmentation map ¢ obtained by applying the above projection to each a, inside the 
expansion f = ons o6[an]” (cf. (166) in Appendix 4). In the real archimedean 
case, the corresponding projection is the map 


0 if x € (-1,1) 


Rd [-1,1=0 > si i= 
D [-1,]] > sign, xh x tea 


When this projection is applied inside the expansion f = i i [fsle *ds of elements 
in B°.;*, it yields the following 


Proposition 7. For f € B+, let f = ty [f:]e *ds be its canonical form. Then 


ets [file tas = tim Fe (49) 


defines a character € : Re — Rof the Banach algebra BR, 


Proof, When z > 00, one has for any s > 50, [f;](z) > [f](z). Thus it follows from 
the Lebesgue dominated convergence theorem that (cf. (41)) 


im f@ = / * [fleas (50) 


Since the operations in the Banach algebra Bet are pointwise when the elements 
are viewed as functions of z, the functional € is a character. oO 


Next, we exploit a theorem of Titchmarsh to show that the leading term of the 
canonical form (45) behaves multiplicatively likewise its p-adic counterpart. In 
particular, it will also follow that the ring Bet is integral (i.e. it has no zero divisors). 
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Theorem 7. The following formula defines a multiplicative map from the subset of 
non zero elements of B?.;* to (0, 1] 


(A) = im, Ufabel Vf = fo Uileds € Bist \ (0) 651) 


Proof. Using (47), we can write the map |p|, with the notations of Proposition 6 as 


4 
= lim |x|, Vx= | buddu. 
l(a) = tim bel Vx = foal 
By (37) one has 


lim |x| = eS, S(0) = inf{E | Ty (€) > 0} € [0, 00). 


e>0+ 


In terms of the measure 4 = dd, S(O) is the lower bound of the support of uw. By 
Titchmarsh’s Theorem [24] (formulated in terms of distributions [19]) one has the 
additivity of these lower bounds for the convolution of two measures on [0, oo) 


inf Support(j41 * (42) = inf Support(j2) + inf Support(j2) 


and hence the required multiplicativity. Oo 


Remark 3. One important nuance between the p-adic case and the archimedean case 
is in the behavior of so under the algebraic operations. As in the p-adic case the 
quantity V = e~ defines a norm, ||.||o, but this norm is no longer ultrametric and 
is sub-multiplicative (cf. Lemma 4) while its p-adic counterpart (cf. Lemma 3(11)) is 
multiplicative. It remains multiplicative for positive measures. 


5.3 The Real Archimedean Norms |\.||p 


We recall that in p-adic Hodge theory one defines, for each p € (0,1), a 
multiplicative norm on the ring 


1 
BP* = Wo (OP) = fF = D1 [anla" € Erxlan € Op, Vn} (52) 
n>—0o 


(cf. Appendix 4 for notations) by letting 


[flo = max |ay|p". (53) 
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To define the real archimedean counterpart of the norm |.|, one needs first to 


rewrite (53) in a slightly different manner without changing the uniform structure 
that it describes. Let q be the cardinality of the field of constants k so that || = q7!. 
Rather than varying p € (0,1) we introduce a real positive parameter a > 0 and 


make it varying so that p* = q_!. 


Lemma 3. (i) For p € (0,1), we seta = 102 (0 = q'/*). Then, for f = 


Tog p’ 
a [an |” € B’* the equality 
n>—oo 


a _ al —n. 4 
ifs = max |anl%q (54) 


defines a multiplicative norm on B’* that describes the same uniform structure 


as the norm |.|p and whose restriction to Q, is independent of p. 
(ii) The limit, as p — 0, of |f|® is the norm |f\o = q", where r is the smallest 
integer such that a, # 0 (cf. [9], Sect. 3.1). 


Proof. (i) One has p* = q™', thus the second equality of (54) holds. Since | - |p is 
a multiplicative norm the same statement holds for | - le When restricted to Q, 
the expression (54) is independent of a since one has |a,| € {0, 1}Vn. 

(ii) As p > 0, also a — 0 and |f|¢ os 0]—max gq" = Ifo. Oo 

néZ 
an#0 

In the real archimedean case, the operation of taking the “max” in (54) is replaced 

by an integration process. By re-scaling, we can replace logq by 1; then the 

archimedean analogue of (54) is given, for each p € (0,1) and fora = 7: 

by the formula 


Ufle:= f fends , wa f [Gle*ds € Bet, (55) 
AY) SO 


Lemma 4. Let p € [0, 1). Equation (55) defines a sub-multiplicative norm on Be. 


For p > O one has, witha = ES 
log p 


Ill = i; edule), YQ = / efdu(E). (56) 


For p = 0 this norm coincides with the norm ||f|\o of Theorem 5. 


Proof. Using (47), we can write the functional ||/||, with the notations of Proposi- 
tion 6 


Vv Vv 
\xllp =] [xy|%du, Wx =i [xy ]du. (57) 
0 0 
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Using the notations of the proof of Proposition 6, one has 


Vv Vv love) 
/ beul@du = i eS dy = i; eldu(e)| 
0 0 0 


since the image of the Lebesgue measure m on [0, V) by the map S is the measure 
|dz(€)|. Thus we obtain (56). The sum of the functions associated with the measures 
du; G = 1,2) corresponds to the measure dj, + djt2. Thus one derives the triangle 
inequality ||f, + fallp < Ilfillo + lLAll. The product of the functions corresponds to 
the convolution diz = dj, * diz of the measures 


/ n(é)du(é) = / / h(E + £)dyer (E1)dia(s) (58) 
0 0 0 


which is the projection of the product measure diz; ® diz by the map (&, &) te 
5((E1, &)) = &+&. The module of the product measure diz) @d [12 is |djt1|@|dpy]. 
Moreover, for h > 0 a real positive function one has 


CO [oe 
7 h\dv| = sup / hwdv 
0 0 


It follows that the module of the projection of a measure is less than or equal to the 
projection of its module. Thus we derive 


: |W] S 1}. 


fifallo = / ef/8Pldu(E)| < / eli F82)/108 P| dy (E))||dju2(E2)| = llfillollllo 


which proves that ||.||, is sub-multiplicative. The limit case p = 0 arises by taking 
the limit ~@ — 0 in (55), hence we obtain 


Vv 
Ifllo = / du, V = sup{u | te #0}. (59) 


Thus ||f||o agrees with the norm of Theorem 5. Oo 


The norms ||.||,) behave coherently with the action of the automorphisms F,, more 
precisely one has the equality 


Fa Ilo = Ilfllpva- (60) 


Indeed, for f(z) = [>° e**dy(&) one has 


FA) = i edule) = } eFdlE/A) (61) 
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which gives (60). By construction the archimedean norms ||.||, fulfill the inequality 


lifllo < fll. Vo = o’. (62) 


In particular for J C (0, 1) a closed interval, one has, with p9 = min/ the smallest 
element of J 


[Lfll: = sup [lFllo = IF lloo- (63) 
pel 


Definition 6. We define BX, to be the Frechet algebra projective limit of the Banach 
algebras completion of B2,,* for the norms || - ||,, for p € (0, 1). 


It is straightforward to check, using (56), that BE, is the convolution algebra of 
measures 4 on [0, oo) such that 


; e “= ldu(é)| <0o, Va>0. (64) 
0 


Proposition 8. (i) The measures which are absolutely continuous with respect to 
the Lebesgue measure form an ideal J C Bt. 
(ii) Let Be. C BE be the sub-ring obtained by adjoining the unit: 


+ + 
Boog =J+RCBQ. 
Then for f € BX, one has f € aan if and only if the map 4 +> Fy (f) € BE, is 


continuous for the Frechet topology of Bd. 


Proof. (i) Follows from the well known properties of convolution of measures. 
(ii) For f € Br the associated measure ju is of the form ado + hd& where h is 
locally integrable and fulfills 


i e “8 Ih(é)|dE < 00, Va > 0. (65) 
0 


The measure associated to F,(f) is a59 + hydé where hy (€) = +h(E/A) by (61) 
and one has 


IF.) —Fv lle = / olin (€) — hy (Ela 


It follows that the map A +> F,(f) € Bd is continuous for the Frechet 
topology of BX. The converse is proven using [ h,(A)F,(f)dA — f for suitable 
functions h,. 

oO 
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5.4 Embedding in the Mikusinski Field % 


In operational calculus one introduces the Mikusinski ring -@(R+) whose elements 
are functions on R; with locally integrable derivative, and where the product law is 
the Duhamel product (cf. [14]): 


Fx G(t) = £ [ rwoe- u)du. (66) 
0 


This ring plays a main role in analysis, in view of some of its interesting properties 
among which we recall that .#7(R +) is an integral ring by the Titchmarsh’s Theorem 
(cf. [14]) and hence it has an associated field of fractions IN called the Mikusinski 
field. 

The following proposition states the existence of a direct relation between the 
Frechet algebra BE, (cf. Definition 6) and the Mikusinski field 9. For f € Br 


co,0? 
f(2) = fo” e du (€), we let 
m(f)(E) = (0, €]), VE = 0. (67) 


We follow the notation of op.cit. and denote by J the function /(€) = & viewed as 
an element of .@(R+). 


Proposition 9. (i) The map aa > f & m(f) defines an isomorphism of Bx, 0 
with a sub-ring of M(R+). 

(ii) There exists a unique function t € Big such that m(t) = I, and one has 
(Zz) = i Vz>0. 

(iii) The isomorphism m, as in (i), extends uniquely to an injective homomorphism 
of the Frechet algebra Bt, into the field M. 


Proof. (i) It is easy to check that the product (66) gives the primitive of the 
convolution product of the derivatives of F and G. For f ¢€ Bey the 
associated measure jz is of the form F(0)59 + dF where dF = F’dé and F" 
is locally integrable. Thus the convolution of the measures jz corresponds to 
the Duhamel product (66), in terms of m(f). Hence, the map f + m(f) is an 
algebra homomorphism and it is injective by construction. 


(ii) The Lebesgue measure dé fulfills (64), and one has 


a 1 
/ e §d&é =—, Vz>0 (68) 
0 Zz 


thus dé defines an element i € By such that m(v) = J. 
(iii) We first prove the following implication 


feBe = > i-fe Bry. (69) 
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Let f € BL, f(z) = fo? e &du(&) with f° e~* |du|(€) < co Va > 0. Let 
W(u) = fo du(&). Then one has 


where the interchange of integration is justified by Fubini’s theorem. It follows 
that 


(-N@ = i. oy (w)du 


and, since y(u) is locally integrable, 1 -f € Bee: Next one defines 


em, Vf e Bh. (70) 


Since m(.) = / this is the unique extension of m as a homomorphism from BX, 
to Wt. It is well defined and yields the required injective homomorphism. 
Oo 


6 Ideals and Spectra of the Algebras Bt, and Be ; 


We denote by BE = BY @r C and Be o= nae @g C the complexified algebras 


of the rings Bt, and Bes (cf. Sect. 5). In this section we investigate their ideals and 
Gelfand spectrum. 


6.1 The Principal Ideals Ker 6, 


In this section we show that for z) € C, with 3t(z)) > O the kernel of the evaluation 
map f +> f(z) defines a principal ideal of the algebras Be and BE 0 

We begin by stating the following lemma which allows one to divide f by the 
polynomial z — z, when f (zo) = 0. 


Lemma 5. Any f € BE extends uniquely to an holomorphic function z +> f(z) of z 
R(z) > 0. 
Let z € C with R(z) > 0. There exists a function f € Bia such that 


f — 9 (f) = (20 — Zt (71) 
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Proof. Since f € Bt, there exists a complex Radon measure jz on Ry such that 


f(u) = ix edule), Vu > 0, i. e ldu(—)|<0o, Va>0. (72) 
0 0 
The integral 
0.5 (f) = i e du(E) (73) 


is finite and bounded in absolute value by the norm ||f||,, for o = e7!/"), Let 
z # 2, then one has 


— _ 9-208 § 
e e* — i eo E20) 20 dy (74) 
x0 -% 0 


and when z — zo both sides of the above equality converge to the function £e~“°*. 
The equality 


Vu) = iy © otet) du(é), Wwe Ry (75) 


u 


defines a complex valued function whose size is controlled by 
Co 
wal s [MMMM [duQy) (76) 


Next, we show that />° e~*"|w(u)|du < 00, fora > 0. Whena > Oanda < (zo), 
by implementing (74) (for z = @ and with (zo) instead of zo) and Fubini’s theorem 
to interchange the integrals, one has 


[oe lo, CO 
fo ceeoiaus [Of mete au cei 
0 0 u 
Cc 
= (R(zo) = a)! / (eo = e *Go)E) |du(&)|. 
0 
This proves that the formula 
[o,) 
{(z) = / e“w(u)du (77) 
0 


defines an element f € Be o Whose norm satisfies, for py) = e'/ Ro) 


Co Co 
Ito fof ervey) 
0 u 


= (lf lo — Mlloo)/C/ log(e) — 1/log(po)). 
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Moreover, using again (74) with z ~ zo, one obtains 


a Oo 2 _ 20 oo § 
f() F (2) _ [ e e du(é) = : [ oe 20) 208 dy du(§) 


Z0 — Z 20 — Z 


which gives 


A oe Es 
i —fO=fe) a 
£0 —Z 
and the equality (71) follows. oO 
Proposition 10. (i) Let z) € C with R(z) > 0. Then 
B24 : BE = C, Oey (f) = f (Zo) 
defines a complex character of the algebra Be: One has 6; = 0. 
(ii) The ideal Ker(@,,) C Be is generated by the function t — z)', with 
Co 
i(z) = e de =z 1, ¥z>0. (79) 
0 


(iii) t—ale€ Bea and t— z)' generates the ideal Ker(6z,) Bea Cc Bea 


Proof. (i) follows from the first statement of Lemma 5. 
(ii) Since one knows that 


[oe) 1 oo 
i e de = —, / e/ 8 dE < 00, Vp € (0, 1) 
6 Z 0 


one derives that 1 € BE o: Let f € Bz, then by applying Lemma 5, one sees 
that there exists a function f € Be o such that (71) holds. One then obtains 


1 
flo) = feo) + (2 =Jh(2), b= ~zolf —flen)) + 3 (80) 


since 


a 


a 
al a Zo — 2 


(iii) By assuming that f € Ker(6.,) we obtain the factorization f = («—zp')h. (iii) 
then follows since t € BE 0- 
oO 
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Lemma 6. Let a : Ba — C bearing homomorphism. Then, if To = a(t) # 0 one 
has (To) > 0 and 


a= 6, 2 =1/Tp. (81) 
Similarly, the maps 6-, : Bag => C with R(z) > 0, define all the characters a of 
ie with a(t) # 0. 
Proof. For any 4 € C with #(A) = 0, A 0, one has 
T Co 
= -§/T 5 gy 
ao i ele &. 
Thus there exists a function hy € BE g Such that (Az + 1)h, = ct. Then one obtains 
a(Ait+ l)a(hy) = a(t) 4 0 


and Aw(t) + 1 4 0 which shows that i(a@(u)) > 0. It follows from Lemma 5 that 
the map 0,,, z) = 1/Tp defines a character of Be. Moreover for any f € BE there 
exists h € B¢ such that (80) holds i.e. 


f =f) + = To)h. 


One then obtains a(f) = f(z) since w(t — To) = 0. oO 


6.2 Gelfand Spectrum of Be , 


We are now ready to compute the Gelfand spectrum of the Frechet algebra Be 0 


Theorem 8. The Gelfand spectrum Spec (BE o) is the one point compactification 
Y = Ct U {oo} of the open half-plane Ct = {z € C | R(z) > O}. 
The one parameter group F, acts on C* by scaling z > Az and it fixes oo € Y. 


Proof. Let a € Spec Be 9 be a continuous homomorphism @ : Be. 9 > C. Let us 
first assume that Ty) = a() # 0. Then by Lemma 6 one has a = 6,,, z) = 1/To. 

Assume now that a(/) = 0. Then, for any smooth function k(€) with compact 
support one has 


l * eEK df = k(O) +2 : aay 
0 0 


This shows that if k(0) = 0 the associated element of Re. belongs to the ideal 
generated by v. Thus this ideal is dense (for the norm |[/||p, cf. Sect.5.3) in the 
kernel of the character 
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Ooo : Bea >C, O0(f) = jim f(2). (82) 


Thus by continuity we get a = Oo, if a(1) = 0. This shows that Spec (BE o) is the 
space of characters 0, : Bee — C, for z € Y = Ct U {oo}. The action of F, is such 
that 


O.(Fi(f)) = Of), Vf € Bry, AER}, ze. (83) 
oO 


Corollary 1. The map Y > z +> Ker(@,) C Ben defines a bijection of Y with the 
space of maximal closed ideals of the Frechet algebra Bio 


Proof. This follows from the generalized Gelfand-Mazur theorem which shows that 
for any closed maximal ideal J C Bap there exists a continuous character Bey >C 
whose kernel is J. oO 


7 The Complex Case and Oscillatory Integrals 


In the real case it was simple to evaluate the asymptotic behavior of integrals 
of real exponentials as in Proposition 7. On the other hand, in the complex case 
we shall see that oscillatory integrals with several critical points provide typical 
examples of application of the (multi-valued) law of addition in hyperfields. Rather 
than developing the general case we focus on a well-known example of asymptotic 
behavior of integrals of imaginary exponentials, namely the case of the Airy 
function (see [2, 7, 23]). This function is defined by the formula (Fig. 3) 


Ai(x) = x / * E+) as, (84) 


(oe) 


Fig. 3. Graph of the Airy 
function 


fib} ti | 4 
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The integral makes sense in the complex domain along a path slightly above the real 
axis, i.e. of the form C = [—oo + ie, co + ie] with « > 0. This function fulfills the 
differential equation 


y"-z=0 (85) 
and it is entire and given by the series 
gin 3n+1 


z 


= 2/3 3-4/3 
ee a 4 9nIP(n + 2/3) 3 “ 9nlP(n + 4/3) 


1 Zz Zz zt 


_ 2 4 = F (86) 
VAG) 38rG) 6x PArG) 12(38FG)) 


We first consider the asymptotic expansion of Ai(x) at infinity, on the positive 
real axis: 


gat 


1 cp Sena 


Ai(Z) ~ (8/4 (87) 


1 
4732 ° 


The series on the right hand side is not convergent and the strong meaning of the 
expansion is that the ratio of the left hand side by the truncated right hand side is 
“under control” i.e. itis of the form 1 + O(z-”), with m depending on the truncation. 
For instance, the ratio of Ai(z) and the approximation 


T1/4 5T7/4 385713/4 
- + 
2/x Jax | 9216./m 


850857!9/4 " oe) 
1327104 f/x 254803968 /x 


hs(T) = e3(t "( 


is of the form 1 + O(T'*/2) since the next term in the expansion is 


5391411025 T3!/4 


- ~ —0.248702 T3!/4 
12230590464 / 


2 
positive and one lets T — 0+. For each a > O we have a natural subgroup Gy 


of the multiplicative group of non-zero functions defined by the condition 


, . pl : : Basta. 
while the first term is re The parameter T in this approximation is real and 


= {h| A(T) = 1+ O(T*) for T > 04}. (88) 


With this notation we can rewrite the above equivalence of the functions Ai(>) and 
hs(T) as 
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Ai (=) /hs(T) € Gy, & = 7 


It is then natural to ask what kind of algebraic object one obtains if one considers the 
quotient of a field K of functions by the above equivalence relation (for fixed value 
of a). By construction Gy is a subgroup of the multiplicative group K* and thus 
the quotient K/G, is a hyperfield. This implies in particular that having strong 
expansions for two functions does not uniquely determine a strong asymptotic 
expansion for their sum. We illustrate this conclusion by considering the expansion 
of the Airy function on the negative real axis. There, the function admits zeros and 
the expansion is more involved and usually written in the form 


Ai(x) ~ <—s caro (G 4 2x ps Pat % cual 8) 3/4)" — 3 
8m, BARR) APOE DOD ee qe-dity oy B 
—sin ( 4 + 3 p2 7 6 (3/4) e 1)! DP —x) *). 


(89) 


In this case we cannot expect that the ratio of the left hand side with a truncation 
of the nea hand side belongs to Gy, for some a > 0 (after changing variables to 
i= —+) since the equivalence relation preserves the zeros except for finitely many 
(since fo a > Oandh € G, one has A(T) = 0 for only finitely many T > O ina 
neighborhood of T = 0). In fact, what the above asymptotic expansion suggests is 
that one can decompose the function Ai(—+) as a sum of two functions which are 


equivalent, in the above strong sense, respectively to (with x = —t) 


=) 3 T(n+ = 8) 3/4)" = 


1 -1/4 uA 
Ca 


neven 


and 


(—x)7'/4sin (5 + zi <) 


1 
2732 3 


3/4)" PP (-a)-?, 


(n+ )\Frn+4 =) 3n 
> a 6 


nodd 


To obtain the required decomposition of the function Ai(x) one uses its definition 
as an oscillatory integral (84), i.e. as an integral along a path slightly above the real 
axis, i.e. of the form C = [—oo + ie, oo + ie] with « > 0. In order to obtain the 
decomposition for x = —+ real and negative, one deforms the path of integration 
C in the complex domain to the disjoint union of two paths C; and C2 as shown 
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Fig. 4 For z real negative 
one deforms the path 

C = [—00, oo] to the disjoint 
union of C; and C. The 
levels are those of the real 
part of the function ®(s, z) 


where ®(s, z) = i(§ + 2) 


in Fig. 4. The integrals over the paths C; give complex conjugate numbers and this 
splitting as a sum f, = fo, + fo, gives Ai(x) = 2N(/,,). In fact, the imaginary 
part 23( to) gives the other Airy function Bi(x) which is known to be a solution 
of the second order linear differential equation y’ — xy = 0. This function can be 
defined directly as the following oscillatory integral 


(ove) 3 3 
Bi(x) = - / (G9) + sin (5 n ws)) ds (90) 


described by the two pieces of a path C, going through the lower half of the 
imaginary axis and the right half of the real axis. Bi(x) is characterized, among 
the solutions of y” — xy = 0, by 


31/6 
Bi/(0) = —— 


Bi(0) = ro 
3 


ar) | 


To obtain the required decomposition of Ai(x) one uses the stationary phase method 
to evaluate f, c, Where C2 goes through the critical point ./—x with an angle of 7/4 
with respect to the real axis, so that it follows the line of steepest descent. This 
shows that the argument of the complex number /, Oo= 5 (Ai(x) + iBi(x)) is close 


toa(x) = 7+ a and thus one introduces the rotation matrix 


2x ./—x 
3 


cosa@(x) sina(x) 
—sin a(x) cos a(x) 


Ro) =| ]. «=§+ 
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which one applies to the column vector &(x) with entries (Ai(x), Bi(x)). By using 
the inverse rotation matrix it follows that 


Ai(x) = cos a(x) (RODE (a); — sina(x) (ROE())2 = Aio(a) + Air) 
It is exactly the decomposition of Ai(x) as a sum of two terms Ai;(x) which gives the 


precise meaning to the asymptotic expansion. Indeed, the stationary phase method 
shows that 


6° (3/4)'x- 2. (91) 


3 ae ay eo (int ay (n+ i) 
e A ~ 
C. 21 3/2 n=0 n! 


Thus since e~) (Ai(x) + iBi(x)) = 2e~*) [/,, this shows that one has 


=m “ 
= ——— cs a(x) 


neven 


Pint 2)r(nt §) 


n!} 


Aig(x) ~ 


(3/4)"x"? (92) 


It follows that for any m > 0 the ratio of the left hand side with the right hand side 
truncated at n < m is of the form 1 + O(x 7) for x < 0, x — —oo as above. 
Similarly one shows that (Fig. 5) 


—1/4 5 
Ais (x) ~ Ae ees > oes lc 8) 3 /4)"(— 1)e-Y/2(-4)-# 
2n*/? nodd ! 

(93) 
in the above strong sense. Notice that the two equivalences (92) and (93) are stronger 
than the original one (89) for Ai(x). In particular they determine exactly the positions 
of the zeros of Aij;(x) for x << 0 as the x, = —437/3(—m + 4nm)*/3for Aig and 
Yn = —43°3 (x + 4nr)?/?for Ai;. On the other hand the zeros of the Airy function 
are not given by an elementary formula. Moreover even the overall sizes of the two 
terms Aij(x) are not the same since while Aig(x) is of the order of (—x)7'/4 the 
function Ai; (x) is of the order of (—x)~7/*. 


Fig. 5 The ratio of Ai; (x) 1.000000 
with its approximation using 
the first four terms of the 0.999999 


asymptotic series 
0.999999 F 


0.999998 - 
0.999998 - 


0.999997 - 
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7.1 Strong Asymptotic Expansion of Se, 


We now work out the details of the stationary phase method, first for the asymptotic 
expansion of /; a= + (Ai(x) + iBi(x)) (when x < 0). We perform a change of 
variables in 


‘ 1 - i( 5 +25) 
Ai(x) = — i e\? “J ds (94) 
27 Jase 
and let x = —w? with u > 0 and s = u3t. We then get 
1 poo 3 
Ay = [ etl) ay (95) 
20 J ees 


and we are looking for the expansion when u — +oo. After the above change of 
variables the two critical points correspond now to tf = +1. For the path C, we take 
the path, in the complex domain, through the critical point tf = 1 and such that the 


real part of c — t remains constant (equal to —4) along the path. In this way the 


variation of the phase will only come from the term dt. With tf = € + in one has 


~ —&P -&+5=0 
which is given by the formula, valid for € > 0, 


lad) BE 
V3J—E 


Along the path C2 one has the equality (Fig. 6) 


3 = /4 
i(5 f) = 3 bet w(é) = 2(-14+ €£)(2 + €)'/41 4+ 28) (96) 


3 2 39/4€3/4 


and w(&) varies from —oo (for § = 0) to +00 for § > oo. One needs to compute dt 
and one has 


1+é4+& 


/383/2 J2+E 


dn/d§ = (97) 
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Fig. 6 The path C, 


and 


1+ 3& + 28 
31/4E7/4 (2 4 §)3/4" 


dw/dé = 


For & + 0 one has 


3 


we Get, dnfdt ~ 6-28-12, dw/dé ~ 3-42-4674 
so that 
d&/dw ~ ci|w\~4, dn/dw ~ colw 
For & > +00 one has 
w~4x 3-483, dn/d&~ 3-2, dw/dé~2x 3-483 
so that 
dé/dw ~ cs|wl3, dn/dw ~ calwi3. 


We can now justify the asymptotic expansion of /, Co" 


(98) 


(99) 


(100) 


(101) 


(102) 
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Lemma 7. When u — +00 one has 


ee -1/2 & 3 1 ; 
ay Ms ye ee 0a 
C) 2/0 ar n! 4 
Proof. Using (96) we have 
iu( 5-1) 2 = 2 
; e\3 Jdt= ete f e”'? o(w)dw (104) 
C. —oo 


where the function g(w) is given by 
8(w) = gil(w) + igo(w) = d&/dw + idn/dw (105) 
where one expresses & as a function €(w) of w € R. The two functions gj(w) are 


smooth and O(|{w|°) when |w| —> oo by (100) and (102). The Taylor expansion of 
g(w) at w = Ois of the form 


w) 1 i\_iw 5 Si oe 385 ¢ 385i TOLLS 
Ww) = =| = = WwW = WwW ete 
2 2) 6 \% % 27 \27648 27648 648 


and the even part +(g(w) + g(—w)) takes the simpler form 


1 
5 (8) + g(-w)) = 


l+i ; Siw~  385w4 17017iw? | 1062347" 
2 48 13824 1990656 382205952 


In fact one has an equality of the form 


1 a : 

5 (8) + a(—w)) = €'* (how) + ih (w)) (106) 
where /o(w) is the real part of e~'4 5(g(w) + g(—w)). By construction both h; are 
smooth even functions and O(|w|°) when |w| — oo. Moreover hj(w) = kj(w?) 
where again both k; are smooth, ko is even and k, is odd. 

Let 


file) / © eh (w)d / © ela) (107) 
i(u) = e i(w)dw = e j(v) —=. 

fi Dae j , are 

The asymptotic expansion of f;(u) for u — oo follows directly from the Taylor 
expansion of h;(w) at w = 0 (or of kj(v) at v = 0, using e.g. Watson’s Lemma). It 
is given by the well known explicit formulas 


aa 2P(at arnt 3) 3 a. Kh 
Pe = (pu (108) 


neven 


Sow) ~ 
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and 


n—1 i + T + 3 
flu) ~ =r ia a li JC ae (109) 
SL SS - 
Thus since by (104), 
i Me) y= e439 fy(u) + ifiW)) (110) 
C2 
one obtains (103). oO 


The above asymptotic expansions hold in the classical sense as defined by 
Poincaré. We shall now see that when one passes to the real part an interesting 
phenomenon occurs. We first need to define more precisely the notion of strong 
asymptotic expansion. 

Let us consider, for a > 0 the following multiplicative subset of functions of the 
variable u. 


Gq = {h| hu) =14+ 0%) for u— +00}. (111) 
Definition 7. Let f(u), t,(u) be functions of the positive real variable u. The 


expansion f(u) ~ S~?° t,(u) is called a strong asymptotic expansion when for any 
a > O there exists ny such that 


Fe (24) Go, Wn > Ng. (112) 
1 


Proposition 11. For x € R, x < 0, there exists a decomposition 
Ai(x) = Aio(x) + Air (x) (113) 


as a sum of two real analytic functions of x with strong asymptotic expansions 


oe 2 Pan+32rat+h _ 
Aio(—u3) ~ Sanco — 34) vey”? = 63/4)" 
neven (114) 
: 2 yu !/6 : =) (n+ 2)0(n+ z) eh 
Aiy(-u!) ~ ——5sin(t - 39 Den 1) a (3/4)"u-". 
(115) 


Proof. One has, using (95), and deforming the path (—oo, co) + ie into the union 
of two paths C; as in Fig. 7 
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Fig. 7 The paths C; 


ng 
Ai(—u3) = —> , ei) ap (116) 
j 2) 


The symmetry s(t) = —f transforms C) into C, but reverses the natural orientation. 


One has 
0(%-«)- CE) 


and thus the terms /, Cj in (116) are complex conjugate. With the notations of (107) 
we define 


1 
3 2 
Aig(x) := dae pe — =u)fo(u), Vx = =u3 (117) 
a 4 3 
and 
ua xz 2 
2 
Ai, (x) := —sin(— — =u)f,\(u), Vx = —u3. (118) 
XN 4 3 


By (116) one has 


Ai(—u3) = <n ( 1 cs) a) (119) 
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and taking the real part of both sides of (110) one gets the decomposition (113). 
Using (108) and (109) one obtains the strong asymptotic expansions (114) 
and (115). Oo 


Proposition 12. There exists an element h of the algebra Be o= Bey @r C such 
that, for any u > 0 one has 


Ai(—u3) = ui R (e\F-}n@w)) (120) 


Proof. Let h(u) = +(fo(u) + ifi(u)), then by (119) and (110) one has (120). It 
remains to show that each f; belongs to ane By (107) one has 


dv 
Jv 


where the function 4;(v) is smooth and of polynomial growth at oo. It follows that 
the measure dj = kv) is a Radon measure such that [ n e*” |duj| < 00 for 


flu) = i; e 2k (vy) (121) 


any a > 0 and one obtains the conclusion using (121) and Definition 6. oO 


7.2 Source Term and Perturbative Treatment 
of the Airy Integral 


In this section we investigate what happens if we treat the Airy integral by 
introducing a source term and by performing perturbation theory around a Gaussian: 
this is a familiar method in the theory of Feynman integrals. We consider the Airy 
integral in the form 


F(u) = uw} Ai(-u3) = =f. 


—oo 


and we introduce a source term 
1 ts iu Bits 
F(u,j) = =| eit 1431) ay (122) 
27 Joo 


in order to understand the relative roles of the variables u and j. One has, for t = 


ys 
p 3 
iu (5 -1+it) = iu(1 —j)*/? (5 -s) 


so that forj < 1 one gets 
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F(u,j) = (1 —j)'?F@( —j)*”). (123) 


The formula (123) gives us, for 7 < 1, fixed the control of the behavior of the 
integral (122) when u —> oo. 

Next, we compare this with the perturbative method around a critical point. We 
choose a critical point for the action without the source, we take t = 1 and write 


t = 1+ @. We are then dealing with the exponent iu (-3 +j+j¢+ ~ + +), 
and thus with the integral 


. . 1 OO iu a uae 
F(u,j) a / e (0 3 *) a9, (124) 
—cCoO 


We now see how this integral is treated in the perturbative manner. One first 
introduces a coupling constant g in front of the interaction term. The reason for 
doing that is to be able to proceed by integrating against a Gaussian. When g = 0 
the integral is Gaussian and one then expands around g = 0 to obtain the result in 
general. Thus one deals with 


: <: il OO (62+ Pai 
Flujig) = eben f e (#48 i) ap (125) 
DIC J tees 


so that F(u,j) = F(u,j, 1). One treats g as small and one looks for an asymptotic 
expansion in powers of g. Since the interaction term is of higher order, we get the 
equation Wy = Legendre(S). The change of variables is that u = 7 and the action 
S(@) is given by 
> 

SG) = 6 +875. (126) 
One computes the Legendre transform of S perturbatively. One has two solutions of 
the equation 6S/d5y = —j which are given by 


_-lt JI-g 
g 


v 


and the solution which is selected by the perturbative expansion is 


V4 + Oj)’. (127) 


_-l+ I-g ji sf ef Ses 
g 2 8 16 128 


One thus gets at the perturbative level 


Wo(i) = S(W+) + I+ 


56 A. Connes and C. Consani 


ee iu(—2-4j : ; 
and taking into account the term e““~3*) one gets the following evaluation for the 
exponent 


i +j+ SQy4) + jv+] =i ae Ee + = ( aa ’) P+ OG)” 
iu\ —= = iu 
qd Jj ae a ae oe gts) 


where in closed form one has 


(-1+ Via) (36% +32 (-1+ vI=a) +(-1+ I=’) 


3 7 


SQW) +iv+ = 36 


Taking g = 1, the above expression simplifies and one obtains 


2, 2 3/2 
3 +jt+ S(W+) +ivt = =, =i) 


which gives the exponent 


iu (-F4 -9°) ; 


This shows that the perturbative expansion corresponds to taking the integral over 
the path C, in the expression of the Airy function and gives a strong asymptotic 
expansion of this term but of course it completely ignores the contribution of C, 
which is nevertheless essential. 

Let us now look at the non-perturbative behavior of the functional integral as a 
function of the source j. One uses the usual normalization which amounts to divide 
by the value at j = 0 and thus we consider 


: Oil 5 —ebit 1 eo ae . 
(—f_« (5 ar) 1 (= ix (5 ar) = F(u,j)/F(u). (128) 


One has, using (123), and forj < 1 


Ai(-C ~ fut) 


F(u,j)/F(u) = (1 —j)'? FC —j)*”)/F(u) = , (129) 
Ai(—u3) 


Since the denominator has many zeros which do not correspond to zeros of the 
numerator one obtains a function which oscillates wildly between the poles as 
shown in Fig. 8. 

We can thus summarize the treatment of the integral with a source (122) as 
follows 


; 1 iw § tit) 
F(u,j) = a e \: dt (130) 


CO 
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Fig. 8 The graph of 6b 
lL for j = 
Ai(—u3 ) 
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1. The quotient F(u, j)/F(u, 0) has infinitely many poles in u when j 4 0 and its 
logarithm does not make sense. 

2. This behavior is due to the presence of two critical points which each contribute 
by a wave without any coherence between the two waves. 

3. The perturbative treatment chooses one of the critical points and makes an 
expansion of the contribution of this critical point, thus giving this term up to 
strong equivalence. 

4. The presence of several critical points forces one to add the contributions of each 
critical point and replaces the exact knowledge of each of these terms up to strong 
equivalence by a hypersum in the quotient hyperfield. 


Thus one can conclude that the way the perturbative method disregards the problem 
of the hypersum relies in the fact that it does not compute the full integral but only 
a portion of it corresponding to the choice of a critical point. Clearly, a complete 
understanding of the process requires to consider the full integral by add up the 
various contributions, therefore the appearance of the hypersum cannot be avoided. 
In the context of gauge theories in physics, the presence of several critical points 
is unavoidable and for this reason we expect that the formalism deployed by the 
theory of hyper-structures (hyperrings and hyperfields) might shed some light on 
the evaluation of Feyman integrals in that context. 


7.3. A Toy Model: The Hyperfield C’ 


The choice of a critical point in the asymptotic expansions which one performs 
in quantum physics to interpret the result in a classical manner is guided by the 
Wick rotation whose effect is to replace an integral of imaginary exponentials by an 
integral of real exponentials. This process is justified in quantum field theory where 
one then rotates back by analytic continuation from the Euclidean formulation to 
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the Minkowski space physical description. This suggests to select the critical points 
using the following total ordering on C. Let C+ C C be defined by 


Cy = {z EC | R(z) = O and 3(z) = 0 if R(z) = 0} (131) 
We write z < 2’ for z — z € Cx. This defines a total order relation compatible with 
addition. 
7.3.1 The Hyperfield C° 
As a set C? = (C|_]C) U {0} is the union of two copies of C and {0}. We write its 


non-zero elements as € e(z) where « = +1 and z € C. The multiplicative structure 
is defined by €e(z) - €’e(z’) := €€’e(z + Z) and the additive structure is given by 


€ e(z), if7 <z 
t ale! €' eZ’), ifz<Z; 
7 132 
Sea € e(Z) ifz=z,e=€'; (2) 
{0} U {e%e(2") | 2" < gj, ifz=z,e=—€'. 


One checks that these laws define a hyperfield structure on C”. Moreover as in the 
real case one obtains 


Proposition 13. The hyperfield C’ is perfect of characteristic one. 


7.3.2. Description of C’ as Perfection of C 


We shall use an analogous formula as in the real case where we take kK = 3 in 
Theorem |. The only difference is that in the complex case we require that the 
sequence (x) jez of complex numbers x” € C, is doubly infinite and convergent 
when j — —oo. This nuance makes no difference in the real case since any sequence 
(x) +0 of real numbers such that x4*) = (x)? , Vj, uniquely extends to a doubly 
infinite sequence (x) jez fulfilling x¥*) = (x), Vj, and the obtained sequence 
is automatically convergent when j — —oo (and its limit belongs to {—1, 0, 1}). 


Theorem 9. One has a canonical isomorphism of C’ with doubly infinite sequences 
x € Cas follows 


c x {x )iez, convergent for j — —oo | Vt) = (x) , We Z} (133) 


The map _,/ associates to 0 the sequence x9 = 0 and tox = €e(z) € C the 


sequence x) = € e*, 
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Proof. The map ,/ which associates to x = € e(z) € C? the sequence x” = € ee 
is well defined since (x)? = x*) as 3 is odd, while x” — € as j > —oo. We 
show that the map ,/ is bijective. It is injective because the sequence see 
determines both € and z by the equalities 


e= lim x9, z= lim 34 (ex —-1). 


Jr-o jJr-o 


Let now x be a doubly infinite sequence of complex numbers as in (133). If x 
0 then all x are 0. Assume that x 4 0 and let yp. = |x|. Then one has |x‘ 
pl/* —> 1 when j > oo. The limit ¢ = limj+—oo x”) fulfills « 4 0 and 


II 


II 


2 = lim?) = limx” =e 


so that € = +1. Replacing x by —x we can assume that € = 1 ice. that x) > 1 
when j — —oo. Since x — 1 one has |x” — 1| < 1 for j < jy and thus 


xD = e"" Vj <j, 2 = log(x™) (134) 


where log(x%) is defined by the convergent series 


oO ; 
; 1 — xo)y2 
log) = — ) Se : 
n 
1 


Thus one gets, with z = 3~zq the equality x? = ee, Vj, and hence the 


surjectivity of |/. Oo 
It is important to have an explicit formula for the natural extension of the 


sequence x") to a continuous, one parameter family x(t) € C,t eR. 


Corollary 2. Let x” be a non-zero sequence of complex numbers such that 
(x)? = x9*) for all j € Z and which is convergent for j > —oo. There exists 
a unique continuous one parameter family x(t) € C, t € R such that 


— x3)=x9, VieZ 
— x(kt) = x(t)‘ for all odd k € Z. 


Let € = limj+—oo x). Then one has for any t > 0 


x(t) = | [(ex)%, Va; € {0,1,2}, Dog ¥ at. (135) 


Proof. The existence of the x(¢) follows from Theorem 9. Its uniqueness follows 
from the density in R of the a3~* where a € Z is odd. To prove (135) one can assume 
that € = lim x” is 1. One then has x(t) = e” for some z € C and (135) follows. Note 
that the infinite product is absolutely convergent since )> j<0 lex — 1] < oo. oO 
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The product of two convergent sequences is convergent and thus it is immediate to 
get the product of two elements of C” from their representation as doubly infinite 
sequences 


(jez. jez = &% y)jez. (136) 
For the addition, the natural formula to try is (as in the p-adic and real cases) 
(x+y) = lim OH + yO (137) 
J7o 


however, one needs to handle here the ambiguity in the extraction of roots of order 
a power of 3. When |x| > |y| this is easily done since, for j > 0 


xD 4 yD = CH (1 + (9 /2)") 
while for j —> oo one has, since |y /x| < 1 
: eat O09 
(1469/2) 1 (138) 
using the unique extraction of roots in a neighborhood of 1. Thus this gives 
Gey? =x it x] > y|, (139) 


What is new in the complex case is that in the case when |x| = |y| (i.e. when 
the two sequences have the same modulus) it is the behavior of x(t) + y(t) on the 
imaginary axis (i.e. for it — +00) which allows one to get the hypersum. The 
required analytic continuation in the parameter ¢ connects with the Wick rotation 
of quantum physics. Note that there is nevertheless also a direct manner to decide, 
assuming y 4 —x, which between x and y is the hypersum x + y, this is achieved by 
considering the behavior of the sequences (x) + and (vy); for j > —oo. When 
|x| 4 |y| it is the sequence of largest modulus. When |x| = |y| one considers the 
sequence 


u(j) = x” fy, j= —O, 
of complex numbers of modulus | and the hypersum is x (resp. y) when the sequence 
rotates in a clockwise (resp. anticlockwise) manner for j — —oo. 
7.3.3 Cas the Quotient of C by the Euler Relation e’” = —1 


We show that C” appears naturally as the perfection of the hyperfield .7C of Viro 
tropical complex numbers (cf. [25]). The multiplicative structure of ZC is the same 
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as for ordinary complex numbers and we recall the definition of the hypersuma ~ b 
in the case of Viro tropical complex numbers. One sets 


1. If ja] < |b]: a~ b=); if jal > |bl: aH b=a. 
2. If |a| = |b| anda 4 —b, with a = re, b = re'® and|a — B| <x 


a~ b = {re'*| |a— | + |p — B| = |a — BI} 


3. Ifa+b=0: aw~ bis the closed disk {c € C | |c] < |al}. 


The hyperfield .7C is not perfect. This conclusion is obvious since the map x > x” 
is not bijective say forn = 3. 


Proposition 14. (i) The following map ev : C? > FC is a hyperfield homomor- 
phism 


ev(€e(z)) := €e&, Ve € {41}, zEC, ev(0) = 0. (140) 


Moreover (with the notations of Theorem 9) one has ev(x) = x for allx € C’. 
(ii) The hyperfield homomorphism ev is surjective and at the level of the multiplica- 
tive groups one has the exact sequence 


1> (elin))? 3 C* S c¥ 1, (141) 


Proof. The map ev : C? -> ZC is multiplicative, we need to check that it is 
compatible with the hyperaddition. Let x = €e(z), x = €e(z’), we show that 


ev(x + x’) C ev(x) ~ evi’). (142) 


Assume first that 3t(z) < 9(z’). Then one has 7 — z € Cy and thusx +x = x’ 
by (132). One has Jev(x)| = e*® and thus |ev(x)| < Jev(x’)| so that ev(x) ~ 
ev(x’) = ev(x’). This shows that (142) holds when H(z) 4 N(z’). Assume now 
that (z) = R(z’). Then |ev(x)| = |ev(x’)| and the definition of the hypersum ~ 
shows that in this case ev(x) ~ ev(x’) D {ev(x), ev(x’)}. This shows, using (132), 
that (142) holds when x’ 4 —x. Assume now that x’ = —x. Then ev(x) = —ev(x’) 
and ev(x) ~ ev(2’) is the closed disk {c € C | |c| < |ev(x)|}. With x = € e(z) one 
has 


x+x = {OF U {e"e(z”) | 2! <p zh. 


But z” <p z implies R(z’) < 2(z) and hence |ev(e”e(z’))| = e*@) < e#O = 
lev(x)| so that ev(e€”e(z’)) belongs to the closed disk {c € C | |c] < Jev(~)j}. 
We thus get (142) in this case also and this shows that the map ev is a hyperfield 
homomorphism. 

For the second statement note that the map ev is a group homomorphism os 
C* and its kernel is the cyclic group generated by the element —e(i) € C’. oO 
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7.3.4 Universal W-Model of C” 


The hyperfield C’ admits a universal W-model. Given finitely many elements yeC 
we denote by Vz; the unique largest element for the total order associated to C1. 
The following formula defines a homomorphism p from the group ring R = Q[C] 
toc 


p( >, ayeju(zj)) = € eZ), Ze = Vz; (143) 
1 
which extends to a homomorphism of hyperfields from the field K = Frac(R) to C’. 


Theorem 10. The triple (W = K,p, tw) is the universal W-model for H = C’. 


The homomorphism p induces an isomorphism of hyperfields W/G = ©, where 
G = ker(p : WX + C”). 


Proof. The proof is similar to the proof of Theorem 2 and is left to the reader. O 


7.3.5 The Map 6¢ and the Ring Cy, 


We proceed as in the real case and construct the universal formal pro-infinitesimal 
thickening of the field C. Theorem 10 gives not only the field W(C’) but also the 
subalgebra Wo(C’) generated by the Teichmiiller lifts [x] for x € C’. 


Proposition 15. There exists a unique ring homomorphism 6c : Wo(C’) > C such 
that ({ ] = tT) 
6c([x]) = Oc(t(x)) = ev(x), Vx eC. (144) 


Proof: By construction Wo(C’) is the subalgebra (over Q) generated by the 
Teichmiiller lifts [x] for x € C?. With x = ee(z) one has [x] = €u(z) and thus 
one gets that Wo(C”) = Q[C]. Thus the natural map u(z) +> e* extends by linearity 
and uniquely to a ring homomorphism 


Ac(> > aj[xi]) = Y- aev(xi) eC. (145) 
Oo 


As in the real case, this suggests to consider the homomorphism 6c : Wo(C’) > C 
of Proposition 15 and introduce the following 


Definition 8. The universal formal pro-infinitesimal thickening Coo of C is the 
Ker(c)-adic completion of Wo(C’), i.e. 


_.: b n 
Coo = lim Wo(C")/Ker(6c)". 


n 
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We shall now proceed as in the real case to show that Ker(@c)/Ker(6c)* is an 
infinite dimensional complex vector space. Our main goal will be that to construct 
explicitly a two dimensional complex space of linear forms on Ker(@c)/Ker(c)*. 
We introduce the following vector spaces over C 


Definition 9. We let Homz(C, C) be the complex vector space of all additive maps 
L:C-—= C, and Homg(C,C) C Homz(C, C) the two dimensional subspace of 
R-linear maps. 


One has by definition 
(ad + bw)(z) := ad(z) + bW(z) € C, Va,b,zE C, ¢,W € Homz(C, C). 


Note that Homey(C,C) C Homz(C,C) is also the subspace of additive maps 
which are measurable and that it is only by the virtue of the axiom of choice 
that Homz(C, C) is infinite dimensional, while only the elements of the subspace 
Homa(C, C) can be concretely exhibited. We write the elements of Homg(C, C) in 
the form 


L(z) = az+bz, VzEC (146) 


so that (a, b) are the natural coordinates in this complex vector space. 


Lemma 8. (i) Let £ €¢ Homz(C, C), then the map 


F(X) = Yo aye" | WX = \° aiulzi) € Wo(C) (147) 
defines a ring homomorphism % : Wo(C’) — & to the ring of entire functions 
of the variable u € C and 


Oc(X) = F(X)(0), VX € Wo(C’). (148) 


(ii) Let £ € Homz(C,C). Then the following formula defines a linear form on 
Ker(@¢) /Ker(8c)? 


Ker(6c) 3 X b 6¢(X) = (F200) (149) 


u=0 


Proof. (i) For each u € C the map z+ e+" js a group homomorphism from 
the additive group C to the multiplicative group C*. Thus this map extends to a 
ring homomorphism from the group ring Wo(C”) = Q|C] to C. This shows that 
& is a homomorphism to the algebra of functions with pointwise operations. 
Since %(X) is a finite linear combination of exponential functions of wu it is an 
entire function. One checks (148) using the definition (144) of Oc. 
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(ii) First the right hand side of (149) vanishes when X € Ker(@c¢)? since the entire 
function %(X)(u) admits a zero of order at least two at u = 0 as can be 
seen using (148). This shows that 6¢ is well defined. It is clearly additive. 
Let us show that it is C-linear. For the structure of complex vector space on 
Wo(C)/Ker(@c) = C, the multiplication by a complex number y € C is 
provided by the multiplication by any s € Wo(C’) such that 6c(s) = y. We 
then have, with X € Ker(6c), the expansion at u = 0 


Tu(sX)(u) = Fils)(u) F(X) W) = Oc(s)5e(X)u + Ow’). 


This shows that 57 is C-linear. oO 


7.3.6 The Periods € and x, 
As in Theorem 3 one can use Lemma 8 to show that the “periods” of the form 
1p = [e(log p)] — p are linearly independent elements of Ker(6c)/Ker(6c)?. Next, 


we construct another “period” which is purely complex. We start with the analogue 
of the element € € F(C,) of the p-adic Hodge theory. We define in our case 


e:= e(2in) EC’. (150) 

The natural square root e of ¢ is e&@ = e(im) € C? and one has 
Ac([e]) = 1, w € Ker(6c), w = ([e] - 1)/(e™] - 1). (151) 
The last part follows from w = 1 + [e] and the fact that e’” = —1. Now that we 


have these various “periods” we can evaluate on them the natural linear forms 5¢ on 
Ker(6c)/Ker(@c)? given by elements of Homg(C, C). 


Lemma 9. For L € Homg(C, C) given by (146) one has 
5 (ap) = (a+ b)plog(p), 8:(w) = in(b—a). (152) 


Proof. Let £ = L with L given by (146). One has 


51(ttp) = (=) , FJi([e(log p)] — p)(u) = (=) 7 grepruitser) = pi(logp) 


which gives (a + b)p log(p) by (146). Similarly 


iu) = (=) Fi + ew) = (=) eit bull) — 1 (in) 
u=0 u=0 


which gives im(b — a) by (146). Oo 
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Appendix 1 


The following table reports the archimedean structures that we have defined and 
discussed in this paper and their p-adic counterparts (cf. [9]). 


p-adic case Archimedean case 


sign = {—1,0,1} hyperfield of signs 


FR =R CC =F(C) 


€1= e(2in) € C’ = F(C) 


6 = lim [1,1] C lim {kd <3 


Xb >xk zh zk 


tot = fF) = Jo oF dul) | u finite 
real measure} 


faj, fetavetes £5 =F ie 
s<t 


Wox(Or) = fx € B+ |x =D boln"} (fe BES | Iifllo < = fF € BES Lf = 
n= Ie tfle*as} 


GC >> bale") = DO bd] B,(f° [ilem'ds) = f° iftleds 


n>>—oo n>>—oco 


[xlp = max|an|“q” Ill = J" ilteas 


Appendix 2 


In this appendix we give a short overview of the well-known construction of 
universal perfection in number theory: we refer to [10], Chap. V, Sect. 1.4; [11, 26], 
Sect. 2.1; [9], Sect. 2.4 for more details. 
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The universal perfection is a procedure which associates, in a canonical way, 
a perfect field F(L) of characteristic p to a p-perfect field L. This construction is 
particularly relevant when char(L) = 0, since it determines the first step toward the 
definition of a universal, Galois equivariant cover of L (cf. Appendix 4). 

We recall that a field L is said to be p-perfect if it is complete with respect to a 
non archimedean absolute value | |,, L has a residue field of characteristic p and the 
endomorphism of @,/p@_,, x — x? is surjective. Furthermore, the field L is said to 
be strictly p-perfect if @, is not a discrete valuation ring. 

Starting with a p-perfect field L, one introduces the set 


FO =G= 6 ren a ER GP y ax). (153) 
If x, y € F(L), one sets 
(x + yy” = Jim ar” + yortm)yp”. (xy) = xy, (154) 


We recall from [9] (cf. Sect. 2.4) the following result 


Proposition 16. Let L be a p-perfect field. Then F (L) with the above two operations 
is a perfect field of characteristic p, complete with respect to the absolute value 
defined by |x| = |x|:. Moreover if a C my, is a finite type (i.e. principal) ideal 
of G1, containing pGy,, then the map reduction mod. a induces an isomorphism of 
topological rings 


Ory — limO,/a, x= Open > ¥ = (Xmod. dren (155) 
neN 


where the transition maps in the projective limit are given by the ring homomor- 
phism x — x?. 


In other words, the bijection (155) allows one to transfer (uniquely) the natural 
(perfect) algebra structure on lim O,,/0 over the inverse limit set Orr) = lim COL 
v—>vP Xt> xP 
of p-power compatible sequences x = (x),50, x” € @,. Indeed, one shows that 
for any v = (U,) € lim O,/a and arbitrary lifts x, € Oy of vu, € O,/avn => 0, 
U—>vP 
the limit x = limp+oo a, 4m €xists in On = 0 and is independent of the choice 
of the lifts x,. This lifting process is naturally multiplicative, whereas the additive 
structure on lim G,/a lifts on Oz) as (154). 


v—>vuP 


m 
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Appendix 3 


In this appendix we provide, for completeness, a proof of Proposition 2. We recall 
that a pro-infinitesimal thickening of a ring R (cf. [12], Sect. 1.1.1 with A = Z) is 
a surjective ring homomorphism @ : A — R, such that the ring A is Hausdorff and 
complete for the Ker(@)-adic topology i.e. 


A = limA/Ker(0)". (156) 


n 


As a minor variant, we consider triples (A,6,t), where 0 : A — R is a ring 
homomorphism with multiplicative section t : R — A and condition (156) holds. 

A morphism from the triple (A;, 0;, T,) to the triple (A2, 92, T2) is given by a ring 
homomorphism @ : A; — Ao such that 


Mm] =aoT, 0; = 00a. (157) 


Let R be a perfect ring of characteristic p and let W(R) be the p-isotypical Witt ring 
of R. Let pr : W(R) — R be the canonical homomorphism and tr : R > W(R) the 
multiplicative section given by the Teichmiiller lift. 

By construction one has Ker(pg) = pW(R) and condition (156) holds. 

We show that for any triple (A, p, t) fulfilling (156), there exists a unique ring 
homomorphism from (W(R), pr, TR) to (A, p, T) (Compare with Theorem 4.2 of [15] 
and Theorem 1.2.1 of [12]). The ring A with the sequence of ideals a, = Ker(p)” 
fulfills the hypothesis of [22] (II, Sect.4, Proposition 8). Thus it follows from [22] 
(II, Sect.5, Proposition 10) that there exists a (unique) ring homomorphism @ : 
W(R) — A such that po a = pr. Moreover the uniqueness of the multiplicative 
section shown in [22] (II, Sect. 4, Proposition 8) proves that one has tT = @ © Tr. 
This completes the proof of Proposition 2. 

Next we show that the notion of thickening involving a multiplicative section t 
is in general different from the classical notion. 

Consider R = Z. Then, for any surjective ring homomorphism @ : A — R, the 
map Z 3 nt nl, is the unique homomorphism from the pair (Z, id) to the pair 
(A, 6). It follows that the pair (Z, id) is the universal pro-infinitesimal thickening 
of Z. This no longer holds when one involves the multiplicative section T. 

Given a ring R, we consider R-triples (A,p,t) where p : A — R is a ring 
homomorphism, t : R — A is a multiplicative section (i.e. a morphism of monoids 
such that t(0) = 0 and t(1) = 1) and one also assumes (156). A morphism between 
two triples is a ring homomorphism @ : A; — Az such that p) = p2 0a and 
T =QAOoT. 

Proposition 2 shows that when R is a perfect ring of characteristic p there exists 
an initial object in the category of R-triples. For R = Z the triple (Z, id, id) is a 
Z-triple but it is not the universal one. The latter is in fact obtained using the ring 
Z[[{6p}]] ® (Z ® Ze) of formal series with independent generators 6, = [p] — p, 
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for each prime p and an additional generator e = [—1] + 1 such that e* = 2e. The 
augmentation defines a surjection « : A > Z, p(e) = 0, and there exists a unique 
multiplicative section t, t(1) = 1, such that 


t(p) =p+6,, Vp prime, t(-1) = —-l+e. (158) 


Proposition 17. The triple (Z[[{6,}]] ® (Z ® Zze), €, ) is the universal Z-triple. 
The map 


D:Z-— Ker(e)/Ker(e)?, D(n) := t(n)—1n (159) 


fulfills the Leibnitz rule and its component on 6, coincides with the map i :Z>Z 
defined in [18]. 


Proof. By construction one has 6, € Ker(e) and thus € o t = id. Consider first 
the subring Z[{5,}][e] freely generated by the 5, = [p] — p for each prime p and an 
additional generator e = [—1]+1 such that e* = 2e. Given a Z-triple (A, p, T), there 
exists a unique ring homomorphism 


a: Z[{6,}][e] > A, a(5p) = t(p)—p, a(e) = t(-1) + 1. (160) 
This ring homomorphism extends uniquely, by continuity, to a homomorphism 


a: lim Z[tps][e]/Ker(e)" -A= lim A/Ker(p)" 


n 


and this shows that (Z[[{6,}]] ® (Z ® Zze), €, T) is the universal Z-triple. 
The second assertion follows from [18] (cf. Theorem 1) and the identity 


[nm] — nm = ([n] — n)m + n([m] — m) + ({n] — 1)((m] — m), Vn,m € Z. 


Appendix 4 


In this appendix we shortly review some relevant constructions in p-adic Hodge 
theory which lead to the definition of the rings of p-adic periods. The main 
references are [8, 9]. 

We fix a non-archimedean locally compact field K of characteristic zero with a 
finite residue field k of characteristic p: q = |k|. Let vx be the (discrete) valuation 
of K normalized by vx (K*) = Z. 

Let F be any perfect field containing k. We assume that F is complete for a given 
(non-trivial) absolute value | |. By W(F) and W(k) we denote the rings of isotypical 
Witt-vectors. 
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There exists a unique (up-to a unique isomorphism) field extension €-,x of K, 
complete with respect to a discrete valuation v extending vx such that: 


© v(GE x) = ux(K*) =Z 
¢ F is the residue field of Cr. x. 


One sees that €7 x can be identified with K @w«) W(F). Thus, if z is a chosen 
uniformizing parameter of K, then an element of €f x can be written uniquely as 
© = Diss—oo lAn]", Qn € F. In particular e € K if and only if a, € kVn. 

Let 6¢,, be the (discrete) valuation ring of Er.x. Each element of @¢,, can 
be written uniquely as )~,,.9[dn]a", Gn € F. The projection map C¢,, —> F 
has a unique multiplicative section i.e. the Teichmiiller map a +> [a] = 1 @ 
(a,0,0,...,0,...). 

There is a universal (local) subring Wo,(@r) C Oe,, which describes the 
unique m-adic torsion-free lifting of the perfect Ox-algebra Or. If Ko denotes the 
maximal unramified extension of Q, inside K, there is a canonical isomorphism: 


Ox ® 0x, W(Gr) —> Woy (Gr), 1 [alr © [al. 


If A is any separated and complete z-adic @x-algebra with field of fractions L and 
F = F(L) (cf. Appendix 3 for notation), there is a ring homomorphism 


0: Wo(Gray)) >A, Obl" SOx a". (161) 


n>0 n=0 


In the particular case of the algebraA = Of = Of(cx) (Cx = completion of a fixed 
algebraic closure of K), the surjective ring homomorphism 6) : Gr —> O¢,/(p), 
Oo((x)n>0) = x lifts to a surjective ring homomorphism of @x-algebras 


0: Wo(Orcey) > Ocxe, YBa” YI xOn” (162) 


n>0 n=0 


which is independent of the choice of the uniformizer zr. 
The valued field (x, |-|) (|-| non discrete) contains two further sub-@x-algebras 
which are also independent of the choice of a uniformizer a € Ox. They are 


1 
BP* = Wo (Or[=] = = DY Balt” € Erin € Op, Vn} (163) 
n>—oo 


and if a € mp \ {0} C @r, the ring B’ := Bt] which can be equivalently 
described as 


P= B= = > bale” € Gexl3C > 0, lx C, Yah. (164) 
n>—oo 
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If a p-perfect field L contains K as a closed subfield, the ring homomorphism (161) 
extends to a surjective homomorphism of K-algebras 


0: Baye, >l, OC >> bale") = D0 On" (165) 


n>—oo n>—oo 


which is independent of the choice of z. If moreover L is a strictly p-perfect field, 
then |F(L)| = |L| and the kernel of the map @ in (165) is a prime ideal of Br Lx of 
degree one. One has 


Cae) =L and 0(Wox(@rw)) = OL. 


Let © = &,.x. Then the projection Op — kr, x — x induces an augmentation 
map 


e:BP+ + Ge D> blz") = SO fila” (166) 


n>—oo n>—oo 


with e(Wex(Gr)) = O6- E = (ony con)" | Xn € kr, Wn} is a local sub-field 
of B?*, 
One introduces for r € Ro the family of valuations on B?*: 


x= > [Xn ]”, v(x) = infrez{vu(%,) + nr} € RU {+00} 
n>—oo 


and defines B* as the completion of B+ for the family of norms (q~”’),s0 (¢ = 
[k|), r € v(F). 

An equivalent definition of these multiplicative norms is given as follows: for 
R 5 p € (0, 1] one defines 


lx|p = max|%n|p" (167) 


|xlo = q'", r smallest integer, x, 4 0; |x|1 = sup|x,|. 
neZ 


In view of the description of B? given in (164), the norms (167) are well-defined on 
the larger ring B’. The completion B of B? for these norms, as p € (0, 1), contains 
BtT al for any chosen a € mp \ {0}. 

B is the analogue in mixed characteristics of the ring of rigid analytic functions 
on the punctured unit disk in equal characteristics. 

The subalgebra Bt C B is characterized by the condition 


Bt ={beB| |b, < lh. (168) 
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The extension of rings Bt C Bt Tal gives a perfect control of the divisibility as 
explained in [8], Theorem 6.55. A remarkable property of this construction is that 
the Frobenius endomorphism on B?* 


yo: Bet => Bet v( a [xn]”) - » [x4]” 


n>—0o n>—0o 


extends to a Frobenius automorphism on B’ and on Bt thus to a continuous 


Frobenius automorphism ~ : B == 5 (i.e. the unique K-automorphism which 
induces x +> x4 on F) which satisfies |g(f)|p¢ = (If|p)%, Vo € (0, 1). 

The homomorphism (165) (in particular for L = Cx) extends to a canonical 
continuous universal (Galois equivariant) cover of L 


6:B—>L. 


Appendix 5 


In this appendix we explain the connection of the above construction with the 
archimedean analogue of the Witt construction in the framework of perfect semi- 
rings of characteristic one [3, 5]. Given a multiplicative cancellative perfect semi- 
ring R of characteristic 1, one keeps the same multiplication but deforms the addition 
into the following operation 


xtwy =) waxy’, = (0,I)NQ (169) 


ael 


which is commutative provided w(1—a@) = w(a), Va € J, and associative provided 
that the following equation holds 


_ a(1- 8) 
1—ap 
By applying Theorem 5.4 in [3], one sees that the positive symmetric solutions 


to (170) are parameterized by p € R, p > 1, and they are given by the following 
formula involving the entropy S, 


w(a)w(B)* = w(aB)w(y)") Va, B el. (170) 


w(a) = p®™, S(a) = —a log(a) — (1— a) log(1 —a@), Va € 1. (171) 
We apply this result to the semi-field R'* of tropical geometry. We write the 
elements p € R‘"", p > 1, in the convenient form p = e’ for some T > 0. In 


this way, we can view w(a) as the function of T given by 


wr(a@) = w(a, T) = eS Vael. (172) 
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By performing a direct computation one obtains for x, y > 0 that the perturbed sum 
X +, y is given by 

xt V= (CT $y)" (173) 
The formula (173) shows that the sum of two elements of R'*, computed by using 
wr, is a function which depends explicitly on the variable T. The functions [x](T) = 
x (for x € R4) which are constant in T describe the Teichmiiller lifts. The sum of 
such functions is no longer constant in T. In particular one can compute the sum of 
n constant functions all equal to 1: 


1 tyr 1 +, abe! +r 1= n (174) 


which shows that the sum of n terms equal to the unit of the structure is given by the 


function of the variable T: Tt n’. 


Proposition 18. The following map x is a homomorphism from the semi-ring R 
generated by the functions T +> a’, a € Q4, and the Teichmiiller lifts to the 
algebra of real-valued functions from (0, 00) to R+. with pointwise sum and product 


xP(T) =f(1)""" , WT > 0. (175) 
The range of the map yx is the semi-ring of finite linear combinations, with positive 
rational coefficients, of Teichmiiller lifts of elements x € R‘* given in the x- 
representation by 


x) (1) = x/7, VT > 0, Vx € RO (176) 


The following defines a homomorphism from R to Wo(R’), 


FOB”). BNO = x2) | (177) 


Proof. The proof is straightforward using [3]. oO 
Thus (177) gives the translation from the framework of [3, 5] to the framework of 


the present paper. 
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Moments of Zeta and Correlations 
of Divisor-Sums: II 


Brian Conrey and Jonathan P. Keating 


Abstract This is Part II of our examination of the second and fourth moments 
and shifted moments of the Riemann zeta-function on the critical line using long 
Dirichlet polynomials and divisor correlations. 


1 Introduction 


In part I, see [2], we completed the analysis of the second moment of the Riemann 
zeta-function using the long Dirichlet polynomial method of Goldston and Gonek 
[3] and we initiated the study of the fourth moment by this approach. In particular 
we calculated the contributions from the off-diagonal terms arising from coefficient 
correlations of the form }°,_, d(n)d(n + h) and identified the terms that are missed 
in this approach. In this paper we show how to evaluate these new terms that were 
missing and in doing so we introduce a new technique that is a discrete analog of the 
circle method. This analysis gives a concrete introduction to how we will approach 
higher moments through this circle method approach. In a subsequent paper we will 
show how to obtain the “full-moment” conjecture for the 2kth moment of ¢(s) on 
the critical line, i.e. the full polynomial of degree k* which comprises the main term. 
The idea for this method originates in the work of Bogomolny and Keating; see [1]. 

Thus, we will calculate the contribution of what we call the type II sums (after 
[1]) which arise in the evaluation of 


[OE 2a" 


m<X n<X 
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where s = 1/2 + it and tg(n) = =n d~“e~*, (See [2] for further notation and 
introduction.) To describe the type II sums we observe that integrating term-by-term 
we find that the above is 


Z ys sep OH (E log(m/n)) =TA+TO+E 


mnsx 


where @ is the diagonal 
* Tap (1) Ty,3 (1) 
9 =H) 
n<X 


C is the off-diagonal 


_ Ta,p(m)Ty,3(n) 
c= Tim v 


(5— loa(m/n)) 
4 


msn 
0<|m—n|<m/t 


and & < T* is anerror term; here t = T'~ and the Fourier transform is defined by 
Ho) = [ wanetwr) a 
R 


where e(x) = exp(2zix). 
If we evaluate @ here in the traditional manner, e.g., as in [3], we would now 
solve the shifted convolution problem which consists of evaluating 


Tw,p (1) Ty,6(n + h) 


nsx 


and summing by parts. This analysis was carried out in I. Here we use a new 
approach. We first make use of the fact that ty, and t,,5 are convolutions to write 


—a PV ,—6 
C= Ss m,“m, he te Fea 


m,my 20 


log((nn2)/(m mz)). 


mymynyny SX 
0<|mymz—njn2|<mymz/t 


Now we embark on a discrete analog of the circle method which basically consists 
of approximating a ratio, say m, /n,, by a rational number with a small denominator, 
say M/N, and then summing all of the terms with m,/n, close to M/N; finally we 
sum over M and N. 

To this end we introduce a parameter Q and subdivide the interval [0, 1] into Farey 
intervals associated with the fractions M/N with 1 < M < N < Qand (M,N) = 1 
from the Farey sequence Fg. The Farey interval .Z%y, determined by the fraction 
M/N is defined to be 
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M M+M" M M+M' 
My.n = = ; Se 
N N+N" N N+N' 
where we, 4 Me are three consecutive terms in the Farey sequence 9. Now given 


such an M and N we sum over the terms m, and n; for which m,/n, € @y.n; for 
such a pair we define 


hy = mN — nM. 
The possible range of h; may be computed by 


M M M-+M" n n 
nig (MMM) ne tl 


my 
a= |= 


N 


since adjacent denominators satisfy Q < N + N” < 2Q. In general, the rapid decay 
of v governs the range of h, and hp defined below. 

Also, we note that if Q is not too large then m,/n; € -@yn implies that 
ny/mz € My.n as well. This is because the distance from m/n, to n2/my is 


m n mm, — n\n mm 1 
A | AC a 
ny my nym) TNiM) tT 
On the other hand 
M M 1 
N N’ OQ? 


so if Q? = o(t) then our assertion follows. 
Now we define 


ho = mM — nN. 


We have 
mm2MN — nynoMN = him2M + homN — hyho 
so that 
mim, —NjN2 _ hy hy a hho 
mm mN mM mmMN 
and 
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The error term is negligible so we have now arranged the sum as 


—6 
MENEO hg max mm 2xm,N 2nm.M 
(M.N)= (&1).C#2) 


where 
(1) 7 mN zal nM = hy and (2) : mM — noN = h 


Note that for a given m,n, and h, the condition (*;) implies that m,/n, € yn 
so we don’t need to write that condition. 


2 Smoothing the Sums over M and N 


We introduce another smooth weight function ¢(y), which is an approximation to 
the characteristic function ¥(0,1}() to help with the summation over M and N. In the 
next section we will encounter sums of the form 


Son) := >> 6(5)o )o (F)er aNET ot 


(M.N)=1 


for a finite set of choices of € and n which are of the form 
ea + ep +e3y + €46 


where the ¢; € {—1, 0, 1}. For our weight function ¢ we require that 
60) = 55 [Blo as 
wi 
where #(s) has the properties that 
Res o(s) = 1 and (é) = 0 


for all of the eligible values of € that arise, and that $(s) is analytic in Rs > —1/2 
and has rapid decay vertically in this region. In practice Sg(&, 7) will be combined 
with So(n, &) to obtain 


So(6.n) +5a0n.8) = 9(4) + y 6(S)o(S)arew 
(M.N)=1 


Moments of Zeta and Correlations of Divisor-Sums: II 719 


The second term is 
[U(d) Md 4 Nd = 
YD ge L(G) : ‘Lo(B)s - 
d M N 
_yvr@ (if; Q\" 
- X Fern |, Hose 1+6) (2) aw) 


(af, $@)E( + 1 +0) (2) az). 


The first integral is = €(1 + €) + O((Q/d)~"/) as can be seen by moving the path 
of integration to the left to Rw = —1 /3 and accounting for the residue at the pole 
w = 0; note that since ¢(—&) = 0, there is no pole at w = —&. Thus, altogether we 
have 


+0(07'/3). (1) 


2 
So(E.n) + So(n.&) = ¢ (3) , SU + He +9) 


é2+é+%) 


3. The Case of h, = 0 


We remark first of all that the terms with hj = hy = 0 are precisely the diagonal 
terms. Now we consider what happens if hy = 0 and h, 4 0. We call this a “semi- 
diagonal” term after [1]. 
If hy = 0 then mM = mN. Since (M,N) = 1 it follows that m, = N& and 
= Me for some ¢. Thus we have 


iE yar) 


hy my nyt mm 
(1) 


(M.N)=1 
mln |0 


where 
(1) 3 mN —nM = hy. 


We replace m, by a smooth variable u; and n; by mN/M. We have 
u,LN = mm» < X and so our sum is 


X #(a)#(gerwre 


(M\N)=1 


= Th, 
OE eH fw (5B) aw 
2 dX 7a ¥ 27u;N 


uwyets 
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We save the term with h; = 0 for later and we group the terms with h, and —h, 


together and use W(-v) = W(v). We make the substitution v) = x y in the 


integral and switch the integral over v; with the sum over /, and £. Then (with 
hy > 0) we have that 


LNTh, 
= uylN < X 
27v,N 
implies that 
Lhy < rue 


Thus we have 


X (5) )e (aaa vy 7 2ng (v1) 


(M,N)=1 


‘. aaa da ae dv,. 


2nXvq} 
nes F— 


The sum over /, and £ is 


Cst+14+64+d0(stat+y) (=) a 


Qni (2) T 7 


Together with the integral over v, this is 


i vy Tri Osa f, C(st1 +P +S e(staty) (=) iy, 
0 S 


Now, as we’ve seen before, if Its > 0 then 


[exten dv = x(1-s) ii wi(t)t > dt. 
0 0 


Thus, the above is 


[: fo Vrs | fs+1+6+46)6CU-—s—a-y) (=) ca 


S 


We move the s-path left to iis = —1/2, thus crossing the poles at s = 0,5 = —a—y 
and s = —6 — 6. Thus the above is 
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e(1—-a+p— —y +8) (zx) PF? 
B+6 


_bl-a + B— y + 8) (2) —) dt 
a+y 


i rer y(n (S0+8-+8)E(I-e— 7) 


and altogether we have 


» (5 )e (5) er 


(M.N)=1 


xf rhery(o(ta+b+8.e-e-y) 
0 


_U-a+p— =p+8) (2 "Sat B= y+é) (72x) ™ —)a 
B+6 at+y 


All of the above is predicated on m,/n, < 1. The contribution from the terms 
where 1, < m, will be exactly as above but with the quadruple (a, 8, y, 6) replaced 
with (y,6,a, B). In particular, w + y will be replaced by 6 + y prior to summing 
over M and N. This will give another term 


E o(io(aenwen 


(M.N)=1 


x i- rhein ( ca + B+8)f(1-a—-y) 
0 


td=a+8= y +8) (24) , S-a+B- y + 8) (44) ~ —)a 
B+é a+y 
Now we consider what happens when h; = 0 and hy # O. These terms will 


contribute the “complements” to the above two expressions so that we will be in the 
situation described in (1) and so we can execute the sums over M and N as described 
there, replacing the sums over M and N by ratios of zeta functions with small error 
terms. Thus, we obtain 


[rmv0o( +6+8)e(1-a—y)t(l-y +8)f(1-a + B) 
0 C(22-a+fh—-y+5) 
_ ea E(l-a+f—y+de—a+pye—y +8) 
tT (6B +6) €(2-a+ B-—y+t+6) 


me (=) Cdl-a+B-y+s)Cl-a+Bp)eA-yt+ >) . 
tT (a+ y)e(2-a+B-—y+t+6) 


and the complementary term with a + y replaced by 6 + 6 and vice-versa. 
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This is identical with one of the one-swap terms identified by descending as 
previously described. 

There are further semi-diagonal terms. If we do the exact same analysis as 
throughout this entire section but now focusing on the ratio m, /n2 instead of m,/n, 
then the effect will be to switch the roles of y and 6 in the expression above. Then 
we end up with two more terms and a total of four terms. These terms are identical 
with the four terms obtained by the “descent” method described in Sect. 8 of Conrey 
and Keating [2]. 

A question of whether we have over-counted some terms may arise. But the 
“duplicate” terms for which m,/n,; € -@yy and simultaneously m,/n. € Gyn 
with N < Q and N’ < Q contribute an insignificant amount to the total and so may 
be regarded as part of the error term. 


4 The Case of hih2 ~ 0 


Now we consider 


—$ 
mM, “my Pay "n5 A Th, Thy 
» (5 )e (5 ) 2B zB mm (satan : 


hAyho4O 12 SX 
(M.N)=1 ae (* 1).C2) 


In this case we have a bound for hf similar to that for hy: 


NyM 
pee. 
Q ON © 


In particular, we have 
ninoM XxX 
hih| << —— « =. 
[Piha| ON @ 


Now we replace the sums over ™m, 7,1, subject to (*,) and (*2) by their 
averages. As before, we replace m, by uw; and now we replace m2 by uz. We replace 
n, and nz by u,N/M and u.M/N respectively. We then have 


» #3) )e(g)er 


(M\N)=1 


- - a Th Th 
lal ital G ( 4 ) a dua 
hyhas£0 2 MH2SX 2nuj;N  2nu.M 


Now there are four cases to consider according to the four sign choices of h; and ho. 
We make the substitutions 
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_ TMhil 
7 27u,;N 


_ Thal 
: 27u2M 


and move the sums over / and hz to the inside. The condition u;u2 < X implies 
that 


T?|hyho| 2h 

oe ee, u 

4770, Uo.MN ne 
or 

4n?XMNv1 0 

|hyho| < a 

We get 
-—a—B-y—6 
(= y 3 ’ (*) b (5) y+B-1yote-l 
20 nen Q Q 
(M.N)=1 
: // patrol bre! Ss aed ie 
V1 ,U2 <n hg] < MN 
x (Fo + v2) + bo — v2) + W(H01 + vo) + WRU — v2)) dv, dvp. 
Using 
- lo. ) 
oor +02) =f vett + v2)) at 
0 
we see that 
W(v, + v2) + W(v1 — v2) + Y(—v1 + vw) + H(-v1 — v2) 
[o.2) 
= i W(A)(e(tv,) + e(—tv;)) (e(tv2) + e(—tv2)) dt; 
0 
Also 
( 42XMNv1 02 i: 
-o-y,-p-5 _ | ui 
TP = Tf Sista tye + B+ 8)-—€ ds 

hy hy< MND “ 
and 


Co 
/ vitetY! (tv) + e(—tv1)) dv = eee. el es yies y), 
0 
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and similarly for the integral over v2. Incorporating these, we have simplified 
things to 


Zyr" 3 “(5)e )o (5 aero 1yjota— 1 
Qn M=nzo Q 


(M.N)= 


1 4n?XMN \s 


xf Oa ge ea — ds dt. 


2ni 
The above expression is unchanged if (a, y) is interchanged with (6,5). So the 


result of summing terms for which n,/m, < 1 rather than m,/n, < 1 allows for 
summing over M and N as in Sect. 9; we obtain 


sis Waal ai ae Cd-B-y-s+z2o(l-a—é—-s+w) 
(=) (20 oan |, (9) €(2-a—B-y—6—-—2s+z+w) 
x [yorehre 


Ini 


oo ee 


X_)sqztw 
x Gro ds dw dz dt. 
S 


Moving the s-path to the right to co and the z and w paths to the left to —1/4, say 
we obtain 


[vol (F ae g(a — yf B86 — B- ye a8) 
[(C-a-p=y=9) 


er t(lta—-B+y—dtlt+a—ft(lt+y—64) 
Qn (a+ y)o(2+a—-B+y—s) 


ae C(d-a+B-y+dcd—-a+pel-y +8) 
e (6 +8) €2—-a+fp—y+) 


4xne ee y+sc(lte-B-y +d ed +a-s) 
(a+ 6)f(2+a-—B-y+5) 
xB (zy0—" "CU -a+pel-w+B+y—dscdt+y— DY a 
2n (B+y)S2-a+B+y-—5S) 


4X07 


™ 


( 
maar 
( 


with an error term of O(Q~'/*). This expression is exactly what we were hoping for; 
it is identical to the “two-swap” terms found in the descent approach (see Conrey 
and Keating [2], section 9). 
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5 Conclusion 


We have shown how to reproduce the complete conjecture for the shifted fourth 
moment of ¢ by analyzing the mean square of long Dirichlet polynomials whose 
coefficients are convolutions of two smooth arithmetic functions. In the next paper 
we will carry this analysis out for coefficients which are convolutions of an arbitrary 
number of convolutions and use this to reproduce the full conjecture for the 2th 
moment of ¢ for an arbitrary k. 
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A Note on the Theorem of Maynard and Tao 


Tristan Freiberg 


Abstract In a recent and much celebrated breakthrough, Maynard and Tao have 
independently proved a certain approximation to the prime k-tuple conjecture. We 
have subsequently seen numerous interesting applications of the Maynard—Tao sieve 
method, and in this short survey we will discuss some of its consequences for 
patterns in the gaps between consecutive primes. These include a conjecture of 
Erdés and Turan (as pointed out by Granville), and a conjecture of Chowla (first 
proved by Shiu in 1997). More recently it has been realized that the Maynard—Tao 
sieve method does not only produce small gaps between primes, but, as we will also 
discuss, it may be combined with a construction of Erdés—Rankin for producing 
large gaps between consecutive primes, resulting in small, medium and large gaps 
between consecutive primes. 


1 Introduction 


In 1904, as an extension of Dirichlet’s theorem on primes in arithmetic progressions, 
Dickson [6] posed the question: 


Do there exist m linear forms ajn + bj,..., A,n + b, which give m prime numbers for the 
same value of n, as n runs through an infinite succession of positive integers? 


To the surprise and delight of many, Dickson’s question turns out to be within the 
scope of our knowledge of the distribution of primes in progressions. 

The most recent chapter of this story begins 101 years after Dickson posed his 
question, with the groundbreaking work of Goldston—Pintz—Yildirim [16]. They 
provided a conditional! affirmative answer in the case where m = 2. Their work 
was reinvigorated by the outstanding work of Zhang [38], who provided an uncon- 
ditional affirmative answer for m = 2. Most recently, an extraordinary breakthrough 


'Their condition being that the primes have level of distribution greater than 1/2 (see (10) below). 
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of Maynard [21] and Tao” has finally settled the question, unconditionally, for all m. 
We refer the reader to the expository article of Granville [17] for an account of this 
recent history and the ideas involved. 

Thanks to the work of Maynard et al., we now know that for every m, there do 
exist m-tuples of linear forms that infinitely often produce m primes. However, we 
do not know of any particular m-tuple that does this (except, of course, in the case 
m = 1). Nevertheless, the mere existence of such m-tuples has many interesting 
consequences, some of which have been given by Granville [17]. The theme of this 
note is “consecutive primes in tuples”, and as a sample corollary of Maynard’s work, 
we will demonstrate the following result, which was first given by the author, Banks 
and Turnage-Butterbaugh [2, Corollary 3]. Regarding applications in connection 
with consecutive primes, we would be remiss not to mention here the substantial 
work of Baker and Pollack [1], Pollack [27], and Pollack and Thompson [28]. 


Theorem 1. Let p; = 2 < po =3 <--- be the sequence of all primes. Let a and q 
be a coprime pair of integers, and let m = 2 be an integer. For infinitely many n, we 
have 


Ponti = ++: = Patm = amod q and Pntm — Pn+1 < dBm, 


_ ebmt5 


where By, . In fact, there is an absolute constant c such that one can take 
B 3.4m 


m = cme", and one can take By = 600. 


It was Chowla who conjectured that there should be infinitely many pairs of 
consecutive primes Py = Pn+1 = amodq. (See Guy’s book [18, A4].) This 
conjecture, and its extension to “strings” of consecutive, congruent primes of 
arbitrary length m, without the constraint that they lie in a bounded-length interval, 
was first proved in a splendid paper of Shiu [35]. In the special case where 
m = 2, by combining the ideas of Shiu and Goldston—Pintz—Yildirim, the author 
[12, Theorem 3] proved a conditional version of Theorem | (with a weaker bound 
for Bz), and unconditionally [13, Theorem 1.1] that there are infinitely many “Shiu 
strings” Pn = Pati = a mod q with d,/logn arbitrarily small (cf. Sect. 3), where 
dn = Pnt1 — Pn- Quantitative results were also given in [13, 35], but have been 
superseded by Maynard’s result [22, Theorem 3.3]. 

Theorem | is an almost immediate consequence of Maynard’s original work 
in [21]. The basic observation is that if 0 = hy < hy < -++ < hy are integers 
and if, for every integer h in [0, B] \ {41,..., 4%}, vo +2 = 0 mod £,, for some small 
prime ;, then, for all such h,n + h = 0 mod ¢, for every n = vo mod |], €;. Thus, 
thanks to Maynard, we can produce certain configurations of primes in intervals, 
while forcing the other integers in those intervals to be composite. As Granville 
pointed out,’ this idea can be used to establish an old conjecture of Erdés and 


?Maynard [21] writes: “Terence Tao (private communication) has independently proven 
Theorem 1.1 [...] at much the same time.” 


3Private communication. 
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Turan [8] (also see [9] and Guy’s book [18, Al1]) concerning the gaps d,, between 
consecutive primes. The conjecture—now a theorem—is that there are infinitely 
many arbitrarily long increasing sequences d, < dn41 < +++ < dn4m, and likewise 
for decreasing sequences d, > dy41 > +++ > dytm. This result and some variants of 
it are proved in detail in [2]. 


2 Results and Deduction of Theorem 1 


We consider k-tuples 
HO (x) = {gx t+ Aye, = {gin thy,..., gext+ hy} 


of linear forms in Z[x], each having positive leading coefficient. We call #(x) 
admissible if and only if 


V primes p, #{n mod p : []-,(gin + Ai) = 0 mod p} <p. (1) 
Dickson [6] noted that if 
HE (n) = {gin + hii, = {gin th,..., gin t+ hg 


is to contain k primes for infinitely many positive integers n, #(x) must be 
admissible. Throughout, we will tacitly assume that the k linear forms g;x + h; are 
distinct and have positive leading coefficients g;, so that .#(n) contains k distinct 
positive integers for all sufficiently large integers n. If .#(x) is admissible then 
Te, (g;, 4;) = 1, so these assumptions may be written succinctly as 


Z1,---,8% > 0 and Th<icjex(gihj — gjhi) # 0. (2) 
Conjecture I (Prime k-tuple conjecture). For any admissible k-tuple 
FO (x) = {gix + hija 
satisfying (2), there exist infinitely many positive integers n such that #(n) consists 
of k primes. 


The following formulation of the theorem of Maynard and Tao has been given 
by Granville [17, Theorem 6.4]. 


Theorem 2 (Maynard-Tao). Fix any positive integer m. There is a number ky, 
depending only on m, such that the following holds for any integer k = ky». For any 
admissible k-tuple 


FO (x) = {gix + hi} 
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satisfying (2), there exist infinitely many integers n such that #(n) contains at least 
m primes. One can take any ky, satisfying km log km > e&"**. 


Call an m-tuple {g;x + h;}/_, a “Dickson m-tuple” if there are infinitely many 
n for which {gjn + h;}"_, contains m primes. Let (x) = {gix + hj}, be an 
admissible k-tuple with k > k,,, where k,, log kin > "+4 Thus, #7 (x) contains 
() k,,-tuples, all of which are admissible and so, by the Maynard—Tao theorem, 
contain at least one Dickson m-tuple. But each Dickson sub-m-tuple of #(x) is 
contained in exactly G7) sub-k,,-tuples of (x). Hence, 


#{ J (x) C (x): Y (x) is a Dickson m-tuple} > Gites) (3) 


Thus, as a proportion of the total number of m-tuples contained in # (x), the number 
of m-tuples contained in #(x) for which the prime m-tuple conjecture holds is at 
least (k)~. This basic counting argument can be found in Maynard’s paper [21, 
Theorem 1.2], as well as Granville’s article [17, Corollary 1.4]. 

As Granville [17, Corollary 1.4 et seq.] points out, in the case where g; = g for 
each i, all of this still holds if we require that the primes in .#(n) be consecutive 
primes. That is, we have the following version of the Maynard—Tao theorem. 


Theorem 3 (Maynard-Tao). Fix any positive integer m. There is a number kn, 
depending only on m, such that the following holds for any integer k = ky. For any 
admissible k-tuple 


FO (x) = {gix + hiphay 
satisfying (2), there exist infinitely many integers n such that #(n) contains at least 


m consecutive primes. One can take any k,, satisfying ky, log kin > ebnt4. 


Call an m-tuple {gx + hj}7_, a “Dickson* m-tuple” if there are infinitely many 
n for which {gn + h;}7_, contains m consecutive primes. Essentially repeating the 
argument that gave (3), we see that if #(x) = {gx +h;}4_, is an admissible k-tuple 


with k > k,,, where kj, logk, > e8"*4, then 


#{ J (x) C (x): Y (x) is a Dickson* m-tuple} > (a) (ees (4) 


the right-hand side being at least (k/k,,)”. 
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We deduced Theorem 3 in [2], originally with* k, = eo but as Granville? 
pointed out, k,, need not be larger in Theorem 3 than it is in Theorem 2. Indeed, 
Theorem 3 follows at once from Theorem 2 and the following basic proposition. 
Let us denote the set of all primes by 


P= {pp =2<ppo=3<p3=5<---}. 


Proposition 1. Let #/(x) = {gix + hj}_, be an admissible k-tuple satisfying (2), 
and let B be a positive integer. There exists an integer U, depending only on (x) 
and B, and an integer Vo, such that 


HO (Ux + vo) = {gi(Ux + v0) + hiking 
is admissible and, for all sufficiently large integers n, 
PN (Uk Ip(gi(Un + vo) + hi)) = PN {g(Un + vo) + hi}, (5) 
where 
Ip(gi(Un + vo) + hi) = [gi(Un + vo) + hi — B, gi(Un + vo) + hi + B] 


is the closed interval of length 2B, centered at g;(Un + vo) + hi. 


The following proof is given in [2]. The basic idea behind the construction is 
essentially the same as the one involved in producing gaps between primes by 
considering runs of composite integers of the form n! + 2,...,n! +n. 


Proof (Proof of Proposition 1). Let 
P=lkail<a<k,|b| < B} 


be any k x (2B + 1) array of distinct primes £,, that do not divide 


DpH) = givv-ge TI] [] (eitj— ahi + git). 
1Sij<k || <B 
gixth tht (x) 
Bix thy +ht HC (x) 


Of course we want Dg(#) to be nonzero, and this is the reason for the conditions 
Bax tha th €é (x), a = i,j. To see that Da(H) F 0, note that since #(x) is 
admissible, Te, (gi, ;) = 1,80 gahp—gphat gah = galhy +h) —ha(gy) = 0 if and 


4We used the ad hoc construction of the original version of this note: if £; < -+- < €, are primes 
such that k < €, and £, < €7, and if Q = (€,-++&)7! T1,<e, p, then {Ox + ey, is admissible, 
and whenever it produces primes they must be consecutive. To guarantee the existence of the k 
primes £; with £, < 7, we applied the Maynard-Tao theorem to an admissible K-tuple with K 
sufficiently large in terms of k. 


>See  terrytao.wordpress.com/2013/11/22/ — polymath8b-ii-optimising-the-variational-problem- 
and-the-sieve/ #comment-254079. 
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only if gg = gp and h, = h, +h, in which case gpx + hy +h = gaxtha € W(x). 
Let 


U = Ilex lap 


and, using the Chinese remainder theorem, choose an integer vo satisfying the 
following congruence conditions: 


0 mod €,45 if gxthat+tb¢é (x), 
na mod lay if gaxthagt+be H(x), 
(6) 


Vea € F, Sao tha tb= 


where n,, mod £,, is any congruence class mod €,, such that 


Tes (ginay + hi) #0 mod Lap, (7) 


which exists because .#(x) is admissible. For every n and any 1 <a <k, |b| < B, 
we have 


Bartha tbe KH (x) => ga(Un+ vo) + hg + b = 0 mod Lap, 
SO gg(Un+ v9) +ha +b is prime for at most one n. Hence (5) holds for all sufficiently 


large n. 
Let us verify that the k-tuple 


H (Ux + v9) = {gi(Ux + v0) + hi} Ly 

is admissible. If p + U then the solutions n mod p to the congruence 

The (giUn + givo + h;) = 0 mod p (8) 
are in one-to-one correspondence with the solutions to Mh, (gin + h;) = 0 mod p, 
of which there are less than® p as .#(x) is admissible. Now suppose p | U, that is 
Dp = la for some £,, € #. Then there are no solutions to (8). For if ggx +hg+be€ 
H (x), then by (6) and (7), the left-hand side of (8) is nonzero mod £,», and if 
Bax +hat+b ¢ A(x), we have the following. By (6), 


giUn + givo + hj = 0 mod ,5 


®In fact the number of solutions is k, because p + U implies p + 91 +++ gx Theicjax (git — gjhi), 


so we have (Vem (gin + h;) = 0 mod p if and only ifn = —g) th; mod p for some i, and these k 
congruence classes are all distinct mod p. 
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for some i if and only if 
gio + hi = 0 = gavo + ha + b mod Lap, 
which holds only if 
8algivo + hi) = gilSavo + Na + b) mod Lay, 
that is 
Sah — giha — gib = 0 mod fap, 


which is contrary to the assumption that €,, + Dp(#). Oo 


In the special case where #(x) = {gx + hj}* 


j=, and B = max;z;|hj — hil, 
applying Theorem 2 to the “augmented” k-tuple # (Ux + vo) of Proposition 1 gives 


Theorem 3. Indeed, from Proposition 1 we deduce the following. 
Proposition 2. The following statements are equivalent. 


(i) For every admissible k-tuple of the form F(x) = {gx + hae there exist 
infinitely many integers n such that F#(n) contains at least m primes. 

(ii) For every admissible k-tuple of the form W(x) = {gx + h;}‘_,, there exist 
infinitely many integers n such that #/(n) contains at least m consecutive 


primes. 


Remark 1. Let (x) = {gx + h;}*_, be an admissible k-tuple with hy < +++ < hy. 
Theorem 2 actually applies to certain subsets P’ of P that have positive relative 
density and positive level of distribution. Thus, we can prove that if k is large enough 
in terms of m, as well as in terms of the relative density and level of distribution of 
P’, then for infinitely many n, 


IPN [gen + hy, gn + hy]| = |PN H(n)| = [PN H(n)| = m. 
However, we cannot in general exclude the possibility that for all sufficiently large n, 


IPN 4 (n)| = 2|P’N(n)|. That is, we cannot in general conclude that P’ N.#(n) 
infinitely often contains a pair consecutive primes, p, and p,+1. 


In Table 1, & is an integer such that (i) (equivalently (ii)) of Proposition 2 is 
known to hold for the corresponding m, and B,, is a number such that an admissible 
k-tuple 

{x +0,x+hyo,...,.x +h} with 0<hy <---<hy < By 


is known to exist. Thus, 


lim inf(Pr+m — Pati) = Bn. (9) 
noo 
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In the last four rows of Table 1, c denotes some positive constant (possibly a different 
constant in each instance). The results in rows marked [EH] are conditional on the 
Elliott-Halberstam conjecture, which we will now explain. 

The primes are evenly distributed among the possible arithmetic progressions to 
a given modulus g. Very loosely, the primes are said to have level of distribution 0 if 
this equidistribution is already apparent among the primes up to g!/°*¢, if not for all 
q then at least for almost all g. The celebrated Bombieri—Vinogradov theorem states 
that the primes have level of distribution 1/2. It is not known if the primes have 
level of distribution greater than 1/2, but the Elliott-Halberstam conjecture asserts 
that the primes have level of distribution 1. 

More precisely, let 


m(N) 
A(N; q,a) = 5q,a) - ——, 
(N; q,a) = 1(N; q, a) 5) 


be the discrepancy between z(N;q,a), that is, the number of primes p < N 
with p = amod q, and the “expected” number of such primes, viz. 2(N)/¢(q) 
(assuming (q,a) = 1). We say that the primes have level of distribution 6 if, for 
any given A > | and any givene > 0, we have 


> max |A(N;q.a)| K N(log Ny. (10) 

a<Nne- (qa)=1 
(The notation X < Y signifies that |X| < c|Y| for some constant c, which in the case 
of (10) may depend on «€ and A.) Since 2(N) ~ N/logN as N — ow by the prime 


number theorem, this means that on average over moduli g < N°~<, the discrepancy 
A(N; q, a) is small compared to 2(N)/@(q). 


Proof (Proof of Theorem 1). If a and q are a coprime pair of positive integers and 
{x+0,x+hy,...,x +h}, O< hn <-++ < hy 
is an admissible k-tuple, then 
KH (x) = {qx +a, q(x + ho) +.4,...,q(x + hy) + ah 
is also admissible. Choosing U and vp as in Proposition 1, we see that if the interval 
IN,N+ qh], N= q(Un+v)+a 

contains any primes, they all lie in #(Un+ vo), all of whose elements are congruent 
to a mod q. By Theorem 2, .#(Un + vo) contains at least m primes for infinitely 


many n, provided k is sufficiently large in terms of m. Applying the results of Table 1 
completes the proof. Oo 
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Table 1 Bounded length 


: ve : m |\k Bn Reference 
intervals containing primes [EH] I> le 16 Tapy [1 6] 
2 | 3,500,000 70,000,000 Zhang [38] 
2 | 632 4680 Polymath [29] 
2 |105 600 Maynard [21] 
2 |50 246 Polymath [30] 
[EH] |2 |5 12 Maynard [21] 
[EH] |3 | 105 600 Maynard [21] 
[EH] |3 | 54 270 Polymath [30] 
m | [e&"+4] ebnt5 Granville [17] 
m | [cme*"] cme" Maynard [21] 
[EH] | m | [cm?e?”] cme" Maynard [21] 
m | [ce4—28/15))_ | eme4—28/15)™ | Polymath [30] 
[EH] | m | [ce?”] cme" Polymath [30] 


Remark 2. One may deduce the aforementioned conjecture of Erdés and Turan 
concerning increasing and decreasing runs of prime gaps by considering k-tuples 
of the form {x,x + h,...,x +h! or {x,x—h,...,x— ht. 


The major development of Goldston, Pintz and Yildirim (GPY) in [16] paved 
the way for all of the results in Table 1. They established, on’ [EH], that any 
admissible 6-tuple of linear forms infinitely often produces at least two primes. As 
the 6-tuple {x,x + 4,x + 6,x + 10,x + 12,x + 16} is admissible, this meant one 
could conditionally take By = 16 in (9). Briefly, the GPY method runs as follows. 

Let an admissible k-tuple #(x) = {x + hy,...,x + hy} be given. Let (Aq) be 
a sequence of real numbers, so that v(m) = ( at da) is always nonnegative, and 
set vv(n) = v((n + h))---(n + hy)). Letting 1p denote the indicator function of 
the set P of all primes, consider the sum 


k 
S#(Nim= > (dott m= (m1) eta, (11) 


N<n<2N * i=1 


If S;z(N; m) > 0 then we can conclude that |.(n) NP| => m for some n in the range 
N <n < 2N. GPY discovered a system of weights (Aq) for which S.y(N;2) > 0 
for all sufficiently large N, assuming (the unproven conjecture) that 6 > 1/2, and 
provided that k is sufficiently large in terms of 6. 

For reasons to do with level of distribution, (Az) is assumed to be supported on 
the positive integers d < R = N‘°-©/?. A typical choice for Ay (d < R) would be 


Aa = 11(d)(log(R/d))*. 


TIn fact, a level of distribution of 6 = 0.971 suffices. 
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With such a choice, for N < n < 2N, the divisor sum )> d\n Aa approximates the 
generalized von Mangoldt function 


Ax(n) = Day (A) (log(n/d))>, 


which vanishes if n has more than k prime factors. That is, v4 (n) essentially detects 
those n € (N, 2N] for which n + hy,...,1 + hy are all prime. However, with this 
choice the above positivity argument always fails. 

GPY considered, more generally, 


Aa = (d)F (log R/d) 


for a smooth compactly supported function F : [0,0o) > R. They found that by 
choosing F(x) = x**!, the above positivity argument succeeded (with m = 2), 
assuming 9 > 1/2, k sufficiently large in terms of 6 and1 ~ /k. GPY also used 
their weight to prove an important unconditional result, as we will see in the next 
section. 

Zhang [38] was the first to prove unconditionally that lim inf, 0(Pn+1 — Pn) is 
bounded, not by showing that the primes have level of distribution greater than 1/2, 
but by working a substitute of the Bombieri—Vinogradov theorem into GPY’s proof, 
which had the effect of going beyond the level of 1/2. Zhang [38] showed that for an 
admissible k-tuple (x) = {x + h;}‘_,, there exist infinitely many integers n such 
that #(n) contains at least two primes, provided k > 3.5 x 10°, and consequently 
that one could take By = 7 x 10’ in (9). Zhang’s proof was subsequently refined in a 
Polymath project [29] to the point where one could take By = 4680. More precisely, 
it was established that admissible 632-tuples contain at least two primes infinitely 
often, and there is an admissible 632-tuple # (x) with #(0) © [0, 4680]. 

Maynard [21] broke the next major barrier by establishing the existence of 
infinitely many bounded-length intervals containing more than two primes. May- 
nard achieved this chiefly by considering a more flexible system of weights (Ag,_..a) 
than GPY’s, in which 


vse(n) = ( py Nd yoo «) 


dj|n+h; Vi 


may depend on each term of the product (n+h,)---(n+h,) individually. Moreover, 
Maynard’s proof only requires that the primes have level of distribution 6 > 0, 
which means that it applies to subsequences of the primes for which only weaker 
level of distribution results are known hold. For instance, Thorner [36] has shown 
that there are infinitely often bounded gaps between primes in a given Chebotarev 
class, and established a number of interesting applications of this result in itself. 

In [17], Granville uses explicit bounds for 2(N) to show that if 2klogk > 10°, 
then (2k log k) — m(k) > k. Thus, for such k, we can choose k primes larger than 
k, but less than 2klogk, to form an admissible k-tuple. So, for any m > 1, if k is 
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the smallest integer satisfying klogk > e8”*+, one can find an admissible k-tuple 


H (x) = {x + hj}_, such that (0) C [1, 2klogk]. Thus, (9) holds with By = 
e®”+> by the Maynard—Tao theorem. 

Maynard [21] does better than this for large m: there is an absolute constant 
c such that the Maynard-Tao theorem holds with any k > cm*e*” and (9) holds 
with B,, = cm>e*”. Moreover, for specific m one can be explicit. Thus, Maynard 
[21] shows, for instance, that any admissible 105-tuple produces at least two primes 
infinitely often, and there is such a tuple that “fits” in an interval of length 600, 
whence one can take By = 600. A Polymath project [30] has refined and optimized 
aspects of Maynard’s proof and obtained still better bounds (see Table 1). 


3 Small, Medium and Large Gaps Between 
Consecutive Primes 


Recall that d, = pPnr+1 — Pn denotes the nth prime gap. In [16], GPY used their sieve 
weights to establish, unconditionally, the longstanding conjecture that 


dn 
lim inf =0 
n>oo lo gn 


The prime number theorem implies that p, ~ nlogp, ~ nlogn as n — oo, so 
the normalized nth prime gap d,,/ log n is approximately 1 on average. GPY’s result 
then tells us that d,,/ log n is infinitely often arbitrarily small. At the opposite end of 
the spectrum, the result 


. dy, 
lim sup =0o 
noo log n 


goes back to Westzynthius [37]. 

What about limit points of the sequence {d,,/logn : n > 2} in-between these two 
extremes? We denote this set of limit points by L. Perhaps surprisingly, no other 
limit point is known—not even | is known to be in L (the prime number theorem 
implies the existence of a limit point less than or equal to 1). However, as Erdés 
wrote in [10, p. 4]: “It seems certain that the sequence d,,/ log n is everywhere dense 
in (0, co).” 

Indeed, based on the prime number theorem, which implies that 


2N 
Y woy~ fo 5 woo), 


N<n<2N logt 


Cramér proposed that for large N, the sequence (Ip(m) : N < n < 2N) should 
behave like a sequence (X, : N < n < 2N) of independent random variables X,,, 
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where X,, = 1 with probability 1/ log N, and X,, = 0 with probability 1 — 1/logN. 
Thus, Cramér’s statistical model suggests that the gaps between consecutive primes 
should follow an exponential distribution: for any given b > a > 0, we expect that 


b 
#{N <n <2N:d,/logN € (a, b]} ~ nf e ‘dt (N > ov). 


Gallagher [14] showed that this prediction would follow from a certain uniform 
version of the Hardy—Littlewood prime k-tuple conjecture—obviously well beyond 
current ideas. 

Erdés [10] and Ricci [33] were able to show that L has positive Lebesgue 
measure, or more precisely that if A denotes the Lebesgue measure on R, then 
A((0, 2] N L) > 1/8. Goldston and Ledoan [15] have recently shown that, for any 
T > 1, the method used by Erdés yields A([0, T] NL) > (1/@)(1 — 1/T), where 
@ is any overestimate in the sieve upper bound for the number of generalized twin 
primes. (One can take @ = 7/2 by work of Bombieri et al. [4].) Using Zhang’s 
breakthrough [38], Pintz [25] has shown that there is a small (ineffective) positive 
constant 7 such that L D> [0,7]. None of these results demonstrates the existence 
of a finite limit point greater than 1. This was a challenge posed by Erdés, which 
was first answered by Hildebrand and Maier [19]. In fact, they showed that there is 
a positive constant c for which A([0, 7] NL) = c T holds for all sufficiently large T, 
and hence that L contains arbitrarily large limit points. 

In [3, Theorem 1.1], the author, Banks and Maynard show that for any given 
sequence of k = 9 nonnegative real numbers f, < B2 <--- < Bx, we have 


(B)- Bis 1S i<jfSKOLFO. (12) 


The reader may deduce, using properties of Lebesgue measure, that asymptotically 
at least 12.5 % of all nonnegative real numbers belong to L, that is 


AO, T] NL) > 7/8 (To). 


The basic philosophy behind [3] is that the Maynard—Tao sieve method, which 
produces small gaps between primes, can be combined with a construction of Erd6és 
[7] and Rankin [31] for producing unusually large gaps between primes, yielding 
gaps between consecutive primes of any desired intermediate size. We paraphrase 
the following proof outline from [3, Sect. 3]. 

The Erdés—Rankin construction produces long intervals (n, n+ z] containing only 
composite integers. This is accomplished by choosing a set of integers {a, : p < y}, 
one for each prime p < y < z, so that for every integer g € (0, z|, the congruence 
g = a, mod p holds for at least one p < y. By the Chinese remainder theorem 
one can find an integer vo, uniquely determined modulo P(y) = Me <yP, Such that 
vo = —a, mod p for every p < y. Now suppose n = vo mod P(y) andn > y. For 
any g € (0,z] we have g = a, mod p for some p | P(y), andsog+n=a,—a,= 
0 mod p; hence, g + n is composite for each g € (0, z]. We say that the progression 
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Vo mod P(y) sieves out intervals of the form (n,n + z], where n = vo mod P(y) 
and n > y. Noting that log P(y) ~ y by the prime number theorem, the goal is to 
maximize the ratio z/y. 

As we have seen, the Maynard—Tao theorem has established the existence of 
m-tuples of primes in k-tuples of integers the form #(n) = {n+ hy,...,n + hy}, 
whenever® # = {h,,...,/,} is admissible and k is large enough in terms of m. It 
turns out that in the Maynard—Tao theorem one can restrict 7 to lie in an arithmetic 
progression—in fact this simplifies certain estimates in its proof. 

Given a sufficiently large number N and a modulus W = II, <p, P» where Do 
grows slowly with N, one can take n € (N, 2N] with n = vo mod W, provided that 
Vo is an integer for which (vp + h;, W) = 1 for each 7. Instead of the sum in (11), 
one aims to show positivity of 


k 
a (Dots +m) Om) vse. (13) 


N<n<2N i=1 
n=vo mod W 


Choosing the progression vo mod W carefully, one can use it to sieve out all 
integers in intervals of the form (n,n + z| with n = vo mod W except for the 
integers in .#(n). Used in this way, the Maynard—Tao theorem produces consecutive 
m-tuples of primes in intervals of bounded length, more or less as in Proposition 1 
above. 

The results of Banks et al. [3] require a modification of the above ideas to obtain 
consecutive primes in #(n) = {n+ hy,...,n + hy}, n € (N, 2N], with differences 
h, — h; of order log N. To do this, we give a uniform version of the Maynard—Tao 
theorem [3, Theorem 4.3], in which the elements of # are allowed to grow with 
N. We also need to be able to take Do as large as € log N, which is problematic for 
reasons related to level of distribution. 

In the original work of Maynard [21], Do = log; N, hence W is less than a 
power of log, N, though one could take Dp = log, N (W less than a power of log NV) 
without affecting the proof. In our case W may be as large as a small power of 
N, and so our extension of the Maynard—Tao theorem requires a modification of the 
Bombieri—Vinogradov theorem that exploits the fact that the arithmetic progressions 
with which we are concerned have moduli that are all multiples of the smooth integer 
W. Specifically, instead of (10) we really only need a bound of the shape? 


Y> wr? max |A(N; rW,a)| « (14) 
(rW,a)=1) 


r<No-5 
(7,W)=1 


o(W)(log N)4” 


8Let us write #7 = {hy,..., hy} now, instead of (x) = {x+ hy,..., x + hy} as in Sect. 2. 
Actually, we need to restrict the sum to moduli rW that avoid multiples of any “exceptional 
moduli” arising from putative Siegel zeros—a technical complication, which we ignore here for 
the sake of exposition. 
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In [3, Theorem 4.2] we essentially prove that (14) holds for any 6 > 0 and A > 0, 
with 6 = 1/2. (We have W = T1,<p, p with Do = € log N, and we require € to be 
sufficiently small in terms of 6 and A.) This result makes use of standard zero density 
estimates, combined with state of the art bounds due to Chang [5] for character sums 
to smooth moduli. 

The next step is to use a slight modification of the Erd6és—Rankin construction to 
find an arithmetic progression v9 mod W that sieves out the integers in an interval 
(0, z], except for precisely k integers /1,..., hy € (0, z| that constitute our admissible 
k-tuple. As it turns out, we are able to choose # so that for each j and any given 
By = +++ = Bi = 0, we have h; = (6; + € + o(1)) logN. As in the Erdés—Rankin 
construction, we select the integers {ap : p < y}, y < z, in stages according to their 
size. We take 0 < y; < yo < y < z, Say, where y, and y2 are parameters to be chosen 
optimally later. 

First, we put a, = O for primes p € (1, ya]. Next, we use a “greedy sieve” to 
choose the a, optimally for the small primes 2 < p < y,, that is, we successively 
choose a, so that g = a, mod p for the maximum possible number of g € (0, z] 
that have remain “unsifted” thus far. Since we do not know the congruence classes 
a, mod p for the smallest primes, our approach does not work in general for all 
k-tuples 7 = {hy,..., hx}; we find it convenient to select our k-tuple only after 
sieving by primes p < yz. We choose the numbers h; from among the primes 
in (y, z]. (This is why we do not use p = 2 “greedily”—if we had a2 = | then 
only even integers would remain unsifted.) It is clear that each hj # a, mod p for 
all p € ()1,y2] since for those primes we have a, = 0. We can also guarantee 
that h; # a, mod p for the small primes p < y, if we select primes h; in a 
suitable arithmetic progression vo mod P|, where P; = [], <p<y, DP: We choose 
y, = (logy)!/4, so such primes exist by (Page’s version of) the prime number 
theorem for arithmetic progressions. 

The reader may note that the results of Table | suggest that we could only hope 
to show positivity of the sum (13) for large N and m = 1 if k => 50, and yet 
we claim (12) holds for any 6, > --- > B, => 0 with k = 9. Indeed, to obtain 
the numerically superior result, we actually employ an extra inclusion-exclusion 
argument and sieve upper bound, which was considered by contributors to the 
Polymath project [30]. 

We suppose k is a sufficiently large multiple of 9 and 


is a partition of H# = {h,,..., hx} into 9 sets each of size k/9. Instead of the sum 
in (13), we consider 


k 9 
ba (x Ip(n +h) —1— > > 1p(n + h)1p(n + W))vseta. 


N<n<2N i=1 J=l hWe FG 
n=vo mod W h<h’ 
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If this is positive then some n must make a positive contribution, and this happens 
only when there exist two elements h; € # and h; € #,i # j, such that n +h; and 
n + hi are both prime. 

We cannot obtain asymptotic estimates for the new terms corresponding to 
Ip(n + h)1p(n + 1’), but we have 


Ipin+h')< (Canty ha) 


if Ai are real weights with Ai = |, and we can obtain an asymptotic upper bound 
for sums of the form 


S> tea +) Caingn Aa) ve(n), 


N<n<2N 
n=vo mod W 


for suitable (4). The estimates are such that all of this works with a partition of #7 
into no less than 9 sets. We refer the reader to [3, Sect. 4.2] for details. 
We ultimately choose k nonnegative numbers 


Bi,..., Bi, Bo...., B2,..., Bo... Bo 


(each f; repeated k/9 times), and using our modified Erdés—Rankin construction to 
construct an admissible k-tuple # such that #7 = #U---U#% isa partition with 
each #4 of size k/9 and having all of its elements of size (8; + € + o(1)) logN. 

As we mentioned, the purpose of the Erdés—Rankin construction is to produce 
gaps between consecutive primes that are as large as possible. In 1935 Erdés [7] 
showed that 


dy > (logn) (log, n) (log n)~? 
for infinitely many n. In 1938 Rankin [31] extended this to 
d,/F(n) = c— o(1) (15) 
for infinitely many n, where 
F(n) = (logn) (log, n) (log, n) (log; n)~*, 


originally with c = 1/3. Nearly 25 years later, Schonhage [34] showed that (15) 
holds for infinitely many n if c = e” /2, and Rankin [32] subsequently showed this 
holds with c = e”. 

In 1979, Erdés offered his largest ever prize of $10,000 to anyone who could 
show that 


lim sup d,/F(n) = oo. (16) 


n—>Co 
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Until very recently, only two improvements on Rankin’s record had been made. The 
first was due to Maier and Pomerance [20], who showed that in (15) one can take 
c = coe’, where co = 1.312... is the solution of the equation 4/co — e 4/0 = 3, 
The second was due to Pintz [24], who showed that c = 2e” is admissible. 

Finally, by turning the basic idea of Banks et al. [3] around, that is, by incor- 
porating the Maynard—Tao sieve machinery into the Erdés—Rankin construction, 
Maynard [23] has succeeded in establishing (16). This has also been shown using 
different methods independently by Ford et al. [11]. Maynard has even indicated 
a way to replace F(n) in (16) by a function that grows faster by a factor of 
(log, nite), 

The obvious question now is: what about other limit points of the sequence 
d,/F(n), or for that matter of d,/f(n) for any sufficiently slowly growing function 
f? In fact, our method in [3] applies to limit points Ly of {d,/f(n) : n = ny}, for 
any increasing function f such that f(2N) — f(N) < 1 for all sufficiently large N, 
and such that f(N) < (log N)(log; N) (log, N)~! for all large N. It is possible to do 
better—this question has been addressed by Pintz [26]. We expect that one can show 
that Ly contains 12.5 % or more of the nonnegative reals as long as f(N) < F(N). 
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A Prime Analogue of Roth’s Theorem 
in Function Fields 


Yu-Ru Liu and Craig V. Spencer 


Abstract Let F,[‘] denote the polynomial ring over the finite field F,, and let Pp 
denote the subset of F,[] containing all monic irreducible polynomials of degree 
R. For non-zero elements r = (r,/2,73) of F, satisfying 7} + r2 +173 = 0, let 
D(Pr) = D,(PWpr) denote the maximal cardinality of a set Az GC Ag which 
contains no non-trivial solution of ryx; + rmx. + 3x3 = O with x; € Ar (1 < 
i < 3). By applying the polynomial Hardy-Littlewood circle method, we prove that 
D( Pr) Kq | Pai / (og log log log | Pr). 


1 Introduction 


Forn € N = {1,2,---}, let D3([1,n]) denote the maximal cardinality of an 
integer subset of [1, m] containing no non-trivial 3-term arithmetic progressions. In a 
fundamental paper, Roth [20] proved that D3([1,”]) « n/loglogn. His result was 
later improved by Heath-Brown [8], Szemerédi [24], Bourgain [3, 4] and Sanders 
[21, 22]. In 2014, Bloom [2] showed that D3([1,n]) < n(loglogn)*/logn, which 
gives the best upper bound up to date. Szemerédi [23] proved that subsets of the 
natural numbers with positive upper density contain arbitrarily long arithmetic 
progressions, and in 2001, Gowers [5] proved a quantitative version of Szemerédi’s 
theorem. 

One can consider analogous questions with [1,7] replaced by P[1, n], the set of 
positive primes up to n. Let D3(P[1, n]) denote the maximal cardinality of an integer 
subset of P[1,n] containing no non-trivial 3-term arithmetic progression, and let 
gt(n) denote the cardinality of P[1, n]. In [6], Green proved that 
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log log log log log x i 
Ds(PL1, n}) « x(ny{ Blog los log log x(n) \ 
log log log log x(n) 


In [7], Green and Tao proved that subsets of the primes with positive upper density 
contain arbitrarily long arithmetic progressions. 

Let F,[t] denote the ring of polynomials over the finite field F,. For R € 
N = {1,2,...}, let Ae be the subset of F,[¢] containing all monic irreducible 
polynomials of degree R. Let r = (r), 12,73) be non-zero elements of F, satisfying 
ry trotr3 = 0. Let (x1,22,.x3) € F, [AP be a solution of ryx1 + rox. + 3x3 = 0. 
We say that (x1,%2,x3) is a trivial solution if xj = x2, = x3. Otherwise, we 
say that (x,,x2,x3) is a non-trivial solution. Let D(Ar) = D,-(Ar) denote the 
maximal cardinality of a set Ag © Pe for which there is no non-trivial solution of 
rx) + 12x2 +173x3 = 0 with x; € Ar (1 <i < 3), and let | Ap] denote the cardinality 
of Pp. In this paper, we prove the following theorem. 


Theorem 1. For REN, 


| Pr| 
log log | Pal 


D(Pr) Kq log log 


Here the implicit constant depends only on q. 


In the special case that r = (1,—2,1) and gcd(2,q) = 1, the number D(a) 
denotes the maximal cardinality of a set Ag © Ar which contains no non-trivial 
3-term arithmetic progression. In large part, this paper will follow the approach of 
Green. Our improvement over the analogous bound for Z stems from nice properties 
of Bohr sets in F,[¢] and the availability of a stronger bound for Roth’s theorem in 
F, [#] (see [14]) than in Z. It is worth noting that when studying equations of the form 
1X, +++: + 7X5 = O where r; +--- +7; = Oands > 4, in [14], the authors proved 
that 


Pal 
MP2) Xa Gog Pa" 


which provides a strong bound compared to Theorem 1. Also, Lé has proved a 
function field analogue of Green and Tao’s theorem on arithmetic progressions 
of primes (see [11]). While his method provides results about more general 
configurations in the irreducible polynomials of F,[t], the approach of this paper 
produces stronger quantitative bounds on D(a). In addition, several estimates 
of exponential sums in this paper are essential to various additive combinatorial 
problems in function fields, including the results in [12]. 

In 2011, the above mentioned bound of Green was improved by Helfgott and de 
Roton [9] to 


log log log | Pel 


Ali 
|Ar| < | *l Goglog| Pa)? 
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Recently, Naslund [16] showed that for any € > 0, 


_ . 1 l—-e 
rl < Fl (——__) | 
log log | Pr 


In future work, we will show how their methods can be implemented over F,[¢] to 
improve Theorem 1. 


2 Basic Setup 


We start this section by introducing the Fourier analysis of F,[f]. Let K = F,(¢) 
be the field of fractions of F,[f], and let K. = F,((1/t)) be the completion of K 
at oo. We may write each element a € Kyo in the shapea = )°,_, ajt' for some 
r € Zanda; = aj(a) € F, (i < r). If a, ¥ 0, we define orda = rand we write 
(a) for g°*. We adopt the conventions that ord 0 = —oo and (0) = 0. Also, it is 
often convenient to refer to a_; (a) as being the residue of a, an element of F, that 
we denote by res a. For a real number R, we let R denote q®. Hence, for x € F,[¢], 
(x) < N if and only if ord x < N. Furthermore, we let T denote the compact additive 
subgroup of K,, defined by T = {a € Koo: (a) < 1}. Given any Haar measure da 
on Ky, we normalize it in such a manner that Te lda = 1. Thus if Yt is the subset 
of Koo defined by % = {a € Ky:orda < —M}, then the measure of It, mes(J), is 
equal to N71, 

We are now equipped to define the exponential function on F,[f]. Suppose that 
the characteristic of F, is p. Let e(z) denote e*”” and let tr : F, > F, denote 
the familiar trace map. There is a non-trivial additive character e, : F; — C* 
defined for each a € F, by taking e,(a) = e(tr(a)/p). This character induces a 
map e : Ko — C% by defining, for each element a € Kg, the value of e(a) 
to be e,(res a). The orthogonality relation underlying the Fourier analysis of F,[¢], 
established in [10, Lemma 1], takes the shape 


| etiey aa _ 1, whenh = 0, 
T 0, when h € F,[¢] \ {0}. 


For N €N, let .%y denote the subset of F,[¢] containing all monic polynomials 
of degree NV. For b,m € F,[f] with m monic, (b) < (m) < N and (b,m) = 1, define 
a set 


X = Apmn = {n € SYy|mn + bis irreducible} (1) 


~ {n! € Sy+oram|n' is irreducible and n' = b (modm)}. 
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Thus by the prime number theorem in arithmetic progression in F,[¢] [19, Theo- 
rem 4.8], 


|X| = 


N(m) (er . é) 


(N + ordm)d(m) N + ordm 


where ¢(m) = |{n € F,[t]|ordn < ordm and (n,m) = I}. Define a function 
Abn : Ly — C supported on X by setting 


(N+ord m)¢(m) 


= , whenne X, 
N(m) 


Abmn(n) = 
otherwise. 


In the following, we will abuse our notation and view Aj n,y aS a measure on X. 
By (2), we have 


bmn (X) = D> Abmav(n) = 1+ o(1). 


nex 


For functions h;, hy : Ay — C, we define an inner product 


(hy, h2)x — > hy (n)ho(n)Abmn(n). 


n€.Sn 


We will use the wedge symbol to denote the Fourier transforms on both T and .“y. 
More precisely, for f : T > C andh: Wy > C, the functions f* : Ay > C and 
h* : T > Care defined by 


f(n) = [ f(@)e(—n9)d0— and sh (0) = D> h(nje(n8). 


n€.Sn 


Also, we define the convolution of two functions f : T — C and g : T > C to be 


(fF * 8) (o) = i f(0)e(0 — 8) a8. 


For any measure space Y, let B(Y) denote the space of continuous functions on Y 
and define an operator T : B(X) — B(T) by 


T:hteo (hApmn)*. 
A dual operator T* : B(T) > B(X) of T is defined by 


T* :frefly. 
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We have 

(Th, f)r = (h, T*f)x. 
Also, the map T77* : B(T) > B(T) is given by 

TT" of sf EA, ae 


Furthermore, for an operator T and positive numbers a and b, we define 


ITF llo 
[loo = sup =, 
f ILf lla 
where || - ||, denotes the L* norm and f ranges over continuous functions that map 


to C. A main step in proving Theorem | will be deriving a restriction theorem for 
monic irreducible polynomials. Namely, we will prove the following theorem. 


Theorem 2. Suppose that 6 > 2 is a real number. Then there exists a constant 
C(q, 6), depending only on q and 6, such that 
I llo+5 < C(g.8)N-V. 


As an application of Theorem 2, we are able to derive the Hardy-Littlewood 
majorant property for function fields. Namely, we will establish the following 
theorem. 


Theorem 3. Let (dxy)xea, be any sequence of complex numbers with |a,| < 1 for 
allx € Pp. For areal number 6 > 2, we have 


| X> aetx)] <= C71@.8)| SO (x0) 
xXEPR x€ Dp 


8 = 
where C’(q, 5) is a constant depending only on q and 6. 


Note that in the special case when 6 is an even integer, by considering the under- 
lying Diophantine equation, one can show that Theorem 3 holds with C’(q, 6) = 1. 

For a real number 6 > 1, let 5’ denote the unique real number satisfying 1/6 + 
1/5’ = 1. Since 


IZflls = sup (7f.g)= sup (f,7"g) <[fll2 sup |IT*gllo 
Islly=1 lellyy=1 Nellsy=1 


(3) 


1/2 
= Ufa, supe. 7P*s)"? = MflalTT* Is 
gilyr=l 


to prove Theorem 2, it suffices to bound the quantity 


77" |lv+s = sup [If * Ap» wlls. (4) 
Wr 


IIj7=1 
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In this paper, @ will be used to denote a monic irreducible polynomial. For 
a polynomial x € F,[f], we say that x is Q-rough if for all monic irreducible 
polynomials w@ with w|x, we have (w) > QO. For Q €N, define 


No! I] (1- 1/(w)) if n € Ay and mn + b is O-rough, 
n= (w)<O 


sm. wtm 


0, otherwise. 


By a sieve argument, one can show that 


a 2 (n) = 1+ o(1). 


n€.Snx 


Also, we define Ae) = 0 foralln € Ay. Let A = 4/(6 — 2). For a positive 
integer K = [Alog, N] and 1 < Q< K, let 


=I K 
Wo = Ayan (SQ<K) and Wet1 = Asm — Apo: 


Since ears Wi = Avnmn, by the triangle inequality, to bound || T7™ || .s, it suffices 
to consider 


sup |f*vflls (l<j<K+1). 


Wlls7=1 


To obtain the above bound, we will apply the Riesz-Thorin interpolation theorem 
[17, 25] with the following bounds which we will prove in the next two sections: 


If * Wolloo Kgs Q'Mfll and [If # WO lo Kgs NNT! Ilfllo- 


Notation For k € N, let f(k) and g(k) be functions of k. If g(k) is positive and 
there exists a constant c > O such that [f(k)| < cg(k), we write f(k) « g(k). In 
the following, all implicit constants depend at most on g and 6. In Sect.6, while 
6 is fixed, all implicit constant depends at most on q. Throughout, the letter € will 
denote a sufficiently small positive number. We adopt the convention that whenever 
€ appears in a statement, then we are implicitly asserting that for each € > 0, the 
statement holds for sufficiently large values of the main parameter. Note that the 
“value” of € may consequently change from statement. 


3 AnJL?-L? Estimate 


We first state Merten’s theorem for F,[¢]. 
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Lemma 4 ([13, Lemma 2]). For Q € N, we have 


I] @-tm)' «@. 


(w)<O 
Lemma 5. Fora functionf :T— Cand1<Q<K, 
lf * eo ll2 < ON IIflle. 
Also, one has 
lf * ves, < NN“ [flle. 
Proof. Note that for! < Q< kK +1, 
Ilf * Wo lla = If Voll2 < IVolloollf* lle = IVelloo llfll2- 


For 1 < Q < K, by Lemma 4, 


(Q) (Q—-1) 
Il Valloo s IA p.m. lloo + Aon lloo 


=f" J] (Q-1st)y'+ TT -1ftw))" 


«< ON! +(Q-1)N7! « ONT. 
Similarly, 


hue 


IWx+illoo < llAbmwnlloo + llApmnlloo XK 


N d n A 
Hlmy(n FOE Kit ei 
m 


Thus the lemma follows. 


4 AnL!-L© Estimate 


For a function f : T— Cand1 < Q< K+ 1, we have 


If * WG lloo S IVE lloollfllh. 


The goal of this section is to apply the Hardy-Littlewood circle method to establish 
the following proposition. 
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Proposition 6. For 1 < Q < K, we have 
ys “— 
AS iv — a lie < Q 7 


Note that 

Ion — Abmavlloo = lApmavlloo <1. 
Thus by combining Proposition 6 with the triangle inequality, we obtain the 
following lemma. 


Lemma 7. Fora functionf :T—- Cand1<Q<K+1, 
If * Vo lloo K Q'Ilfll- 


Let B = 2A + 12. Note that for all a € T, by Dirichlet’s theorem for F,[¢] [10, 
Lemma 3], there exist a, g € F,[¢] with g monic, (a, g) = 1, (a—a/g) < N8/((g)N) 
and (g) < N /N®. We define the major arcs IN and the minor arcs m as follow: 


M= |) Mag and m=T\M, 


(9) <NFB 
(a,g)=1 
g monic 


where 
Mag = fa ET] (a—a/g) < N*/(g)N. 


In order to prove Proposition 6, we will separate our analysis into major arc 
contributions and minor arc contributions. 


4.1 Major Arc Estimates 


In the following, we consider a function h : .%y — C which satisfies the following 
condition: 


* Condition* Let r,g € F,[¢] with g monic, (r) < (g) and (g) < N®. Let L = 
N — [Blog, N]. Forr’ € Ay with ’ = r (mod g), let 
Y={r+lg|()<D CH. 
Then 


A 


Shin) = & (vag(h) + OCU). 


neY 


where y;,.(/) is a constant depending on h and E(h) is an error term of size o(1). 


A Prime Analogue of Roth’s Theorem in Function Fields 113 


Let 
o(B) =N' S° e(fn). 


n€. Sv 


Lemma 8. Suppose that (B) < N®/(g)N and that r,g € F,|t] with g monic, (r) < 
(g) and (g) < N8. Forh: Sy = C satisfying Condition*, we have 


S>  A(nye(Bn) = (g)'yr.¢(h)o(B) + O((g)'E(h)). 
n€.Snx 
n=r(mod g) 


Proof. Forn € %y with n = r (mod g), we can writen = g(yt’ + J) + r with y 
monic, (y) = N/(g)L and (1) < L. Moreover, for (/) < L, we have 


nb N 1 
(B(gi+1r)) < en (g)° omy <7 
which implies that e(6(g/ + r)) = 1. Thus by applying Condition* with 
r = gyt' +r, 
Y> h@e(Bny= Yo Ye Algor +) +r) e(B(got +) +7) 
ne S = N L L 
ce ons Alt tat 
= os e(Bgyt") > h(gyt" + lg +r) 
(y) =N/(g)L ()<L 


y monic 


A 


= Sra Sy e(Bgyt) + O((g)~1E(h)). 


(y)=N/(g)L 
y monic 


In addition, for (z) < (gt“) = (g)L, we have 


a ee oe 
(Bz) < len = (2) gi t [Blog M1 ~@ 


which implies that e(8z) = 1. Thus 


Y eboh=—z YY elsyt ty) 


ae i ae 
O=H/e)E BE (et) (=Ajlet 


y monic y monic 
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By combining the above estimates, we have 


Y> h(n)e(Bn) = (g)~"yre(h)o(B) + O((g)'E(h)). 


n€.Sn 
n=r(mod g) 


This completes the proof of the lemma. 


Lemma 9. Leth : .%y — C satisfy Condition*. For a,g € F,[t| with g monic, 
(a, g) = 1 and (g) < N°, define 


ough) = > e (=) Vig(h). 


(r)<(g) 
Then for a € Ma,e, 


a 


h(a) = (g)~'oag(h@ (« = “) + O(E(h)). 
Proof, Write a = a/g + B with (B) < N8(g)—'N-!. Then by Lemma 8, 


h(a) = s h(n)e(na) 


ne.Sv 
ra 
= > (“) > h(n)e(Bn) 
(ate) S87 neFy 
n=r(mod g) 


= (g)'0(8) -(“) Yn g(h) + O((g)(g)"BC) 
(r) <(g) 


= (8) '0(B)oa¢(h) + O(E(h)). 


Thus the lemma follows. 


In the following, we will show that the functions Aj, and Ay d<Q<k) 
satisfy Condition*. We first recall a result of Rhin. 


Lemma 10 (Rhin [18, Theorem 4]). Let c,d € F,[f] with c monic and (c,d) = 
1. For D,M € N, we denote by N(c,d;M,D) the number of monic irreducible 
polynomials w of order M satisfying @w = c (mod d) and ord (wt"* — ct") < 
—D + ord @w + ordc. Then 


A 


ui ~ + O((ordd + D+ 1)M'/?), 


N(c,d;M,D) = ——x 
Md¢(d)D 
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Lemma 11. Let r,g € F,[f] with g monic, (r) < (g) and (g) < N®. Then Apgm.n 
satisfies Condition* with 


ote) if (mr + b, mg) = 1, 


Yrg (Abn) = : 
0, otherwise, 


and 
E(a, a = NBtlte (m) 1/246 qy-1/2 


Proof. Recall the definition of X in (1). Let r € Sy with r = r (mod g) and 
Y = {r + Ig|(l) < L} C Ay. Forn =r + 1g € Y, Apmv(n) = 0 if and only if 
mn+bé x. 


(1) Suppose that (mr + b,mg) # 1. We assume that N? < N. Then there exists 
a monic irreducible polynomial w such that w|(mr + b,mg). Writen = r+ 
l’g + Ig for some I € F, [tf]. Then the polynomial 


m+b=mrt+lg+lg)+b= (mr+b)+mgi+l) 
has a factor w. If mn + b € X, then w = mn + Db. Since 
(aw) < (mg) < (m)N® < (m)N = (mn +b), 


we have w 4 mn + b. Thus we have mn + b ¢ X. It follows that 


S> Abmv(n) = 0. 


neY 


Thus the lemma follows in this case. 


(2) Suppose that (mr + b, mg) = 1. Consider 


~m 


Nw = Ny(m, g, L) =#{n=r' +lg| (I) < Land mn + be X}, 


which is equal to the number of monic irreducible polynomials w with 
ord = N+ ordm, w = mr’ + b (mod mg) and (w — (mr’ + b)) < Lig). 
We now apply Lemma 10 with c = mr’ + b, d = mg, M = N + ordm = ordc 
and D = N —L—ordg. Since L = N — [log, N*], we have 


_ ___ Mn)hle) 
(N + ord m)¢(mg)N 


; i O( (ord g + ord m) + (N—L-—ordg) + 1)N"? (m)"?) 
L(mg) 


> PA hn Ky1/2 4) 1/2 
~ (N+ ordm)¢(mg) ci O((ordm + [Blog, N])N’/*(m) ). 
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It follows that 

> Aomn(n) 

ney 
_ e(m)(N + ordm) L(g) as ) 
7 N(m) (a Fordmogap + OMordm + [Blog, NINO m)"") 
= L o(m)(g) No(m)(N + ord m) sis )) 
= Al p(mg) e of 2N(m) (ord m + Blog, N)N’/"(m) 
= 7 (soe o( Ely) 
~ NX (mg) Ni2 ; 


Thus the lemma also follows in this case. 


Lemma 12. Suppose that a,g € F,|t] with g monic, (a, g) = 1 and (g) < N®. For 
o defined as in Lemma 9, one has 


(g)H(g) , (—abm 2 
. . if (mg) =1, 
Oa,¢(Abm.n) = $(8) ( g ) | 
0, otherwise. 


Here, we write m for the multiplicative inverse of m modulo g and 1(-) the Mobius 
function on F,[t]. 


Proof. By Lemma 11, we have 


Oa,g(Apmwn) = s e (=) Vr,@(Abmw) = a y e (<) 


(te) (te) a 
(mr+b,mg)=1 
= _ y 2 (“) . 
OME) hci g 
(mr+b,g)=1 


For z € Z with z > 0, if w*|g and w*t! } g, we write that w*||g. Let 
s= [[ ow, 
walle oie 
and g1 = g/go. If w|m, then w + (mr+b). Thus (mr+b, mg) = (mr +b, go), and 
ar ar 
e{—]= e{—}]. 
x)= & (3) 


(r)<(g) (r)<(g) 
(mr+b,g)=1 (mr+b,go)=1 
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By writing r = ugo + v with (uv) < (g1) and (v) < (go), we have 


3 e() = 3 -(“) 3 (=). 

(r)<(g) 6 (v)<(g0) BS wee) 8! 
(mr+b,g0)=1 (mv+b,g0)=1 

Since 


2 (=) eas ‘ if (21) 1, 
0, ; 


bret 81 otherwise, 
it follows that 


av . 
ais a > e(“) ; if gy — 1, 
Yo e(—) YS ef—) = 4, M<e) 
g 21 (mv+b,g)=1 


no, (u)<(g1) 0, otherwise. 
One has that (g,m) = 1 if and only if g; = 1. When (g,m) = 1, we have mete = 
an Therefore, to prove the lemma, it is enough to show that when (g,m) = 1, 
we have 


av —abm 
a e| — } = n(g)e 
()<(e) ° 2 
(mv+b,g)=1 


Suppose that (gon ao = 1. Letw = mv+b. Then (w—b)m = v (mod g). By checking 


that Ye is a multiplicative function in g, one can verify that 


(w) <(s) 
(w.g)= 


(ww) <tg) 
(w,g)= 
Thus 
y 7 (=) 2 y : (“*) 4 (=) y : (~) 
(u)<(e) 8 (0) <(g) 6 ES wyeig \ & 


(mv+b,g)=1 (w.g)=1 (w,g)=1 


This completes the proof of the lemma. 
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Lemma 13. Lert r, g € F,[f] with g monic, (r) < (g) and (g) < N®. For1<Q<K, 
the function 42 


bmw satisfies Condition* with 


I] (1 _ 1/(w)) Il (1 = 1/(w)), if (mr + b, mg) is O-rough, 
Veo J=4 mse (a) <0 


wtm wtmg 


0, otherwise, 


and 


EOS 5 =N W/(Q2A)+e 4 Ay W/2+e 


where A = 4/(6 — 2) is defined as in Sect. 2. 


Proof. Let r' € Ay with r’ = r (mod g) and Y = {r’ + Ig| (I) < i} Cc Sy. 
Since (b,m) = 1, if w € F,[¢] is a monic irreducible polynomial with w|m, then 
w + (mn + b). Thus it suffices to consider w with w } m. Let @1,..., Or € F,[e] 
denote the monic irreducible polynomials with (w;) < Q and w; + m(1 <i < R). 


Forn =r+ilge€Y, Ae) = Oif and only if w;|(mn+b) forsome 1 <i < R. 


(1) Suppose that (mr + b, mg) is not Q-rough. Then there exists some 7; such that 
w;|(mr+b, mg). Writen = r+l'g+lg for some I’ € F,[#]. Thus the polynomial 
mn +b = (mr +b) +mg(l+/) has a factor w;. Hence, AO tn) = 0 and the 
lemma follows in this case. 

(2) Suppose that (mr + b, mg) is O-rough, Le., @; + (mr +b,mg) (1 <i < R). Let 
X; denote the event that w;|(mn + b) forn € Y, and let P(X;) = |X;|/L be the 
probability of X; occurring. We denote by Xf the complement of X;. Note that 


1 ait L = 
YAP W@== [] G-tw) 1) x== [] G-v@)oy 
neéY N f i=1 N 6 
pee ple 


where 


It remains to estimate Y. 


(2.1) If w;j|g, then mn + b = mr + b ¥ 0 (mod Wj), ie., w; + (mn + b). Thus 
P(X;) = 0. 

(2.2) Suppose that w; + g. Since w; + m, we have (w;,mg) = 1. If L > (ow), 
as / varies with (/) < L, then mn + b = (mr + b) + Img runs through all 
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residue classes modulo w;. Thus we have P(X;) = 1/(m;). On the other 
hand, if L < (w), then either 0 or 1 choices of / will give w;|(mn + b). Thus 


P(X;) = O(L"'). From the above estimates, we have 


P(X;) = 


€j 
(wi) 
where 


—— 0, if wig, 
1, otherwise. 


By the inclusion-exclusion formula, we have 


rede YW +o(ed()) 


1<ij <<is<R j=1 is) s=1 


Note that for any K’ € N, by considering the even terms of the above 
alternating sum, we have 


ye Sey ail aC e roi (' )) 


s=0 1Si) <**<is<R j=1 


= Hl ( (ow “) + o( a ea ae) 
A ae 
+o(¥(2)) 


Similarly, by considering the odd terms of the alternating sum, we have 


2K’-1 


2 ° 2K’—-1 R 
V> ie S I ator a +0(@ - (‘)) 


1<i, <+*<is<R j=1 s=1 
R Ss 
a Ei, 1G, 
“I Os ae (o ee a) 
7 2K’—-1 R 
+ofz ‘ d (‘)) 
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Thus for any J € N, we have 


y= T] (-1@y +0f ea 
we D _=_ I (@i,) +++ (@i,) 


wtmg 


(Z(')) 


s=l1 


To estimate the error terms, note that 


5-1 J R 5-lpJ+1 
ra a (| «ER. 
s=1 


Also, forJ <s<R, 


El tat tag <a ten) <a D, ten) 


Si) <-<is<R j=l i=1 


1 
< —(In@ +c)’. 
s! 


The last inequality follows from Lemma 4 with c some fixed constant. It follows 
that for J > 30nQ +c), 


> > ISS 
s=J <i <"<is<R j=l (@i,) +++ (@i,) 


R 


1 
< )i inet os 
s=J 
(mQt+ey (,, mQte, (nQte)? 
= J! ( J+1 ° J+) +42) ~) 
J 
g MEIN ps4 Ft) 


eIn(eQ) \" 
< (aS) : 


The last inequality follows from Stirling’s formula, namely that J! = 


V2nJ(J/e)’ (1 + O(1/J)). Thus we have 
eax 
v= |] (1-1m)) +0(( FE?) +itR'), 


(m)<Q 
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Since R < a) < NA, by choosing J = N/(2A log, N), we have 


(< In(eQ) 


J 
ji ) < N71/@A) +e and Lo Rit! < NTH2+e. 


Thus 


V= I] (1—1/(w)) + O(N W/QA)+e 4 Xp rte) 
(w)<O 


wtmg 
By Lemma 4, we have 


I] G-V(@)" s (1—1/(w))' K O K log, N. 
(w)<O (w)<@ 


otm 


It follows that 


This completes the proof of the lemma. 
For a polynomial x € F,[t], we say that x is Q-smooth if for all monic irreducible 
polynomials w with w|x, we have (w) < Q. 


Lemma 14. Suppose that a, g € F,[f| with g monic, (a, g) = 1 and (g) < N®. Also, 
suppose that 1 < Q < K. Foro defined as in Lemma 9, one has 


1H) 6 (=a), if (m, g) = land g is Q-smooth, 


P otherwise, 


Q 
Oa,g Chea) = 


where m is the multiplicative inverse of m modulo g. 


Proof. By Lemma 13, we have 


ar 
rue AS2in) = Do eS) reall 


()<(e) 
= J] G-oy' TT o-oo (=). 
(a) <0 (a)<0 (r) <(g) g 


wtm wtmg (mr+b,mg) is O-rough 
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Note that if (m, g) = 1 and g is Q-smooth, then 


I] @-1/t@))* TT (= 1/(@)) = (8)/6(8). 


(a)<O (w)<O 


wtm wtmg 


Thus to prove the lemma, it is enough to show that 


ar L(g)e (=#*) , if (m,g) = land g is Q-smooth, 
E e(Z)aerG 


(r)<(g) 0, otherwise. 


(mr+b,mg) is Q-rough 


Let 


and g3 = g/g2. If m|m, then w + (mr +b). Thus (mr + b, mg) = (mr + b, g), and 


(r)<(e) & (r)<(g) g 
(mr+b,mg) is Q-rough (mr-+b,g2)=1 


Note that (m, g) = 1 and that g is Q-smooth if and only if g; = 1. Then using a 
similar argument as the one in the proof of Lemma 12 (with go replaced by gz and 
g, replaced by g3), we can show that 


3 (“) LU(g)e (=22) , if (m,g) = 1and gis Q-smooth, 
e{—]= 


inet) 0, otherwise. 
(mr+b,g2)=1 


This completes the proof of the lemma. 
We now summarize the major arc contribution to Proposition 6. 


Lemma 15. For 1 < Q < K, we have 
ete Aj may (2) = oy ()| < ot. 
aem 
Proof. Let a € Wt. Then there exists a, g € F,[t] with g monic, (a,g) = 1, (a — 


a/g) < N®/((g)N) and (g) < N/N®. By combining Lemmas 9, 11, 12, 13 and 14, 
if g is Q-smooth, we have 


A x 7 : x F ie 
[AP w(t) — Ay (a)| K NBM (yy) 1/24 fy 1/2 4+N W/QA)+e 4 1/2+¢ «KO 1 
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If g is not Q-smooth, then there exists an irreducible polynomial w with (w) > O 
and w|g. It follows that 6(g) > (wm) = (w) — 1 > Q. Thus we have 


w~ 


[Ajn.n (ot) — A” (a) < AD wal + [af (| 


K 1/6(g) + NBT (my /24+€NHN/2 4 NOU RATE 4 yl/2+€ 


«on. 


This completes the proof of the lemma. 


4.2. Minor Arc Estimates 


We will now turn our attention to obtaining a minor arc estimate for Ap, n,v(@). We 
will obtain the following result. 


Lemma 16. Suppose that (m) < N. One has 


sup |Apn,y(oe)| K NOB? = NA, 


aem 
where A = 4/(6 — 2) and B = 2A + 12 are defined as in Sects. 2 and 3. 


In order to prove this lemma, we need to establish more notation. Whenever a 
sum has a superscript +, which will look like ae the sum will be restricted to 
monic polynomials. Let R € N, and let U be a parameter with 1 < U < R/2. 
Define rt, by 


+ 
r=) Hd). (5) 
d\x 
(d)<0 
Let 
AGS ord@, when y= w! for some monic, irreducible polynomial w and / € N, 


0, otherwise. 


We now will present a sequence of lemmas concerning the weighted exponential 
sum 


y A(y)e(ay): 


(y)<k 
y=b (mod m) 


from these lemmas, we will be able to extract Lemma 16. Due to the underlying 
shape of Dirichlet series in F,|t], we are unable to take an approach similar to that 
in [1]. Instead, we will follow the ideas of [26, Chap. 3]. 
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Lemma 17. Let v(x, y) denote a function on F,[t}°. Then we have 


ae v(l,y)+ \ hu TrU(X, y) = y y > p(d)u(dz, y). 


U<(y)sR T<(x)<R O<(y)<R/ (x) (d) <0 O<(y)<R/(d) (2) <R/(yd) 


Proof. By writing x = dz, we have 


LY LY r@vay= YL vey no 


(d)<t O<(y)<R/(d) {<) <R/ (yd) O<(x)<k O<(y)<R/(x) d\x 
(d)<U 
+ 
> i ee PC) 
(x) <0 O<{y) <R/(x) d|x 
(d)<U 
* (6) 
For (x) < U, we have 
+ 1, whenx=1, 
S> ud) = 
ae 0, otherwise. 
(d)<t 


Thus 


YY vey e@= YO vi). 7) 


(x) <0 O<(y)<R/(x) d|x U<(y)<R 
(d)<U 


The lemma now follows from (5), (6) and (7). 
Let 


+ + 

S@= > AO)e@y), K@)= D> uw (ordy)e(ary), 
(y)<0 (xy) <R 
y=b (mod m) (x)< U 
xy=b (mod m) 


S@)= YY” wwA@e(ary), 


test oh 


xy=b (mod m) 
and 


S4(a@) = y~ T, A(y)e(axy). 
(xy) <R 


(x).Q)>o 
xy=b (mod m) 
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Lemma 18. One has 


+ 
YI AO)e(ay) = Si(a) + S2(a) — $3(w) — Sala). 


(y)<R 
y=b (mod m) 


Proof. Let 


A(y)e(axy), when xy = b(modm), 


u(x, y) = 
0, otherwise. 


We first notice that 


Y~ AG)e(ay) = Si(@) + S7” vIL.y). 


(y)<R U<(y)<R 
y=b (mod m) 


Thus we are left to show that 


J” u(y) + Se(a) = Sx(a) — $3(a. 
O<(y)sR 


Applying Lemma 17, we have 


vant s@M=" YY p@vcazy). 


o<(y)<k (d)<t) <(y) <R/(d) (z)<R/ (yd) 


Since 


ssa) => ys j(d)v(dz,y), 


(a) <8 (y)<O (2) <R/ (yd) 


by combining this with (8), we find that 


vy) + Sale) =O” OY vz.) - Sx(0) 


U<(y)sk (d)<U (y)<R/(d) (2) SR/(yd) 
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(8) 


-)- Ss ae (d) A(y)e(adyz) — $3(a) 


(d)<8 (y)<R/(d) (2) <R/ (yd) 
dyz=b (mod m) 


= p@e(eaw) * Aw) - S5(0) 


(d)<0 (w)<R/(d) vhe 
dw=b (mod m) 
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+ + 
=> YS U@(rdw)e(adw) — $3(e) 
(a)<t(w)<R/(d) 
dw=b (mod m) 


= S2(a) — S3(@). 


The lemma now follows. 
We will now obtain upper bounds for the sums S;(@), S2(a@), S3(a@) and S4(q@). 


Lemma 19. One has 
Si(a) « UU. 


Proof. By applying the triangle inequality and the trivial bound, we have 


Sia)< SS AQ) «bu. 


(y) <0 
y=b (mod m) 


Lemma 20. Suppose that (a—a/g) < (g)~? with (a, g) = 1. Assume that S,R € N 
with S < R. Then for any real number T with T < R/S, we have 


a ai can) « RS(g)~! + SR + (g)(RS + ord g). 
(x)<8 © T<{y)<R/(x) 
Proof. By the triangle inequality, we have 


ye | >a ~~ "eon (9) 


(x)<S 0 T<(y)<R/(x) (x)< W=0 8 (y= 


R—-ordx 


Also, it was proved in [10, Lemma 7] that 


pa ca) = when (|lex|]) < W-!, 


, otherwise. 


Thus we have 


a RS ord x 


DP) 


(x)<8 W=0 ' G)=W 


min(R—ord x, —ord ||ax||—1) 
= 2s ee W 
(x) <S§ w=0 
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« D>" min (&/(2), (lel) 


(x)<$ 
: + 
«YO Ulex + S22 min (&/(x), (etl)“?) 
(x) <(g) W=ord g (x)=W 
= E, + Ep, (10) 
where 
a = = 
BSS Ua aa. Se (R/(x), (llaxll)") 
(x) <(g) W=ord g (x \=W 
We first bound E. For (x) < (g), since (a — a/g) < (g)~?, we have 
ay ¥ (x) 1 
ax — <—. 
Bl “gyre g) 
we deduce that 


(g) 


Since (\jax/ell) 
(etl) = (IF + (@- xl) = (IF + @- 


P= SRM 


Since (a, g) = 1, we have 
ax 
Bs YO (lx) = (IF 
(x) <(g) (x) <(g) {y)<(g) 
ord g—1 
(g) (ord g). (11) 
We are now left to bound E,. Note that 
Ss ~ R-W-1 
+ R + 4 
E, = De > at V 
W=ord g (x)=W_ : W v=0 W=W : 
(llax|])} 1 =R/W (llexl|) "=v 
R-W 
V 


R-W 
+ A 
De Ds 
W=ord g V=0 (x <qW 
(llaxll)<q¥—! 
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By [10, Lemma 7], we deduce that 


We now apply [15, Lemma 11.1] to get 
» | 2 e(axy) < WI((g) +Wl4+vl+ (g)W'0). 
(x)<qW (y)<Vqr! 


Using this bound, we see that 


W=ord g V=0 
Pa fies fae ‘ (12) 
« (R(g)! + RW + WR+ (2)R) 
W=ord g 
< RS(g)! + SR + (g)RS. 
The lemma now follows by combining (9)-(12). 


Lemma 21. Suppose that (a —a/g) < (g)~* with (a, g) = 1 and ordm < U. Then 
one has 


So(a) < U(m)R? + RR*(g)~! + (g)R(R? + ordg). 


Proof. Note that 


Sa) = >” p@\(ordy)e(axy) 
(xy) sR 


(x)<0 
xy=b (mod m) 


{y) 
Line LP eam fo 


t 
(x)s0 (y)<R/ (x) 
xy=b (modm) 


R/(x) 
=Diuw [ (eee) 
(x)<U 


t<(y)<R/(x) 
xy=b (mod m) 


t 
tlogq’ 
By two applications of the triangle inequality, we get 


+ R/ (x) 
Sa@«>y 


Gea*" 


+ dt 
ye e(axy) | Pe 
t<(y)<R/(x) 
xy=b (mod m) 
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Switching the leftmost sum with the integral in the last expression, we obtain 


So(a@) <f- eal | x ae dassft« a | y eaxy) | 


x)<min(U.R/1) t<(y)<R/(x) <0 © t<(y)<R/(x) 
xy=b (mod m) her 1 y=xb (mod m) 


where x is the multiplicative inverse of x modulo m. We now split the sum over y 
into two sums depending on whether or not (y) < (m). Write y = xb + my’ and 
x’ = mx. Then by the triangle inequality, we have 


Sx(a) « [oom i an y~ i eaxy) |“ 


(x)<0 © max(t,(m))<(y) <R/(x) 
(x,m)=1 y=xb (mod m) 


: R d 
< dima | yy ye e(amxy’) le(a) |“ 


(x)<O © max(1/(m),1)<(y') <R/ (mx) 
(x,m)=1 


«K dima f y~ | y~ e(ax’y’) a 


(x/)<O(m) © max(t/(m),1)<(y’) sR/() 


Since ordm < U < R, by Lemma 20, we deduce that 


R 
Soa) « Ulm)R-+ i (iu + ordm)(g)! + O(myR 
1 


+(g)((U + ordm)R + ord g)) “ 
< U(m)R? + RR2(g)-! + (2)R(R? + ord g). 


This completes the proof of the lemma. 
Lemma 22. Suppose that (a —a/g) < (g)~* with (a, g) = 1 and ordm < U. Then 
one has 


S3(a) < RR*(g)~! + U?(m)R? + (g)R(R? + ord g). 


Proof, For any (x) < U?, we have 


So pwA(vr) K J” AQ) = ord « R. 


X=uUv 


(u),(v) <0 


v|x 
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Write y = xb+my’ and x’ = mx, where x is the multiplicative inverse of x modulo m. 
Then from the above inequality, we deduce that 


+ + 
S@= Yo SO pWA()e(@xy) 
(xy) <R xX=uv | 
(eee Wte)s0 
xy=b (mod m) 


« ye ~~ Hw Co) | x e(axy) 


OSM est igen 
«RY. | x e(axy)| 
(si ()<i/() 


pL 1 y=xb(modm) 


=R | y~ e(amay’)| 


(x) <0? (y')<R/(mx) 
(x,m)=1 


<R = | > e(ax'y’)). 


(x/)<0?(m) (y')<R/(x’) 


Since ordm < U < R, by Lemma 20, we obtain that 
S3(a) & R(R(2U + ordm)(g)~! + U?(m)R + (g)((2U + ordm)R + ord g)) 
< RR?(g)! + 0? (m)R? + (g)R(R? + ord g). 
This completes the proof of the lemma. 


Lemma 23. Suppose that (a —a/g) < (g)~* with (a, g) = 1 and ordm < U. Then 
one has 


S4(a) <K RR! (m)'/2(g\—"/? + RRO? (mm) O12 + RY/?R9/? (my "/? (gy 1/2, 
Proof. By writing x = y\z, z = rs and y; = uv, we have 
+ 2 + s+ 
» Wey ey (= 


(J=V WwW=V (J=V  yle (n)<V )=V yale 
x=X (mod m) YI 


«KIV yt = VO wn)! « 94. 
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Note that 


Sa) = So” AO)e(axy) 
(xy) <R 


(x),9)>0 
xy=b (mod m) 


+ 
~ 
Ty S A(y)e(axy). 
U<VWeR-U RS (j=? (y)=W 
V+WSR X= (mod m) y=¥ (mod m) y=y (mod m) 


Applying the Cauchy-Schwarz inequality and (13), we obtain that 


sa« > > (x re) 


U<V,W<R-U XY (x)=V 
V+WSR = 3}=b (modm) x=X (mod m) 


«(XP | XP Avveten|)” 


(W=V (y)=W 
x=Xx(modm) y=y (mod m) 


z + + a 
< > x ee aa > | y. A(y)e(axy)| ) 
U<V,W<R-U XY vy=V \=Ww 
V+WSR  X}=b (modm) a: a m) = ae m) 
(14) 
One has 
+ + 2 
ee) 
()=V (y)=W 
x=x(modm) y=y(modm) 
an a 
=») YAGI) AOr)e(@x(1 — y2)) 
W=) (=n) =0 
x=Xx (mod m) y1 =y2=y (mod m) 
+ + 
= Y>  AQ)AG+h) YI e(axh). (15) 
(y)=W (h)<W (x)=V 
y=y (mod m) h=0 (mod m) x=X (mod m) 


For (%,m) = (¥,m) = 1, V + W < Rand (m) < min(V, W), since | A(z)| < ordz, 
by writing h = mh’, x = X + mx’ and h"” = m7h’, we have 
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YY AW)AV +A) YS elaxh) 


W)=W  (n)<W (=? 
y=Y (mod m) h=0 (mod m) x=X (mod m) 
<w yt ye | 3 e(axh) 
(y)=W (n)<W (d= 
y=y(modm) h=0(modm) x= eae (16) 


2 pl 


= BF yeah ay 


(h’)<W/{m)  (x’)=V/(m) 


ww + ae 
< “(m) s | > e(axh ) . 


(nh!) <Wm) (x/)=V/(m) 


When V + W < Rand U < min(V, W), it follows from [15, Lemma 11.1] that 


>. | y~ e(ax'h”)| < pS | > e(ax’h") 


(a) <Wim)  (x’)=V/(m) (hl) <Wim)  (x’)<qV/(m) 


A 


<K WV((g) + W lm)! +07! (m)+(g)(WV)') 


«& Rig)! + Rim) U0! + (g). 


Upon combining (14)-(17), we have 


Sa(a) K me > V2 3/2 Wl? Wm) "/? (Rig)! 
U<V,W<R-—U x,y 
Vt+WSR = %}=b (modm) 


+ R(m)0~! + (g)) 1? 


K do RAR? (m)/?(R(g)! + Rim) OA! + (g))1 
U<V,W<R-—U 
V+W<R 


<K RR? (m)'/2(g\— 1/2 4 RR? (m) 07"? + RY? R93? (my "2 (g) 1/2, 


Lemma 24. Suppose that (m) < RR, (g) < R(m) and (a — a/g) < (g)~? with 
(a, g) = 1. Then one has 


So AGe(ay) « R/S (m)RE + (g)R? + RR?!) "/2(g) “1? 


(y)sk 
y=b (mod m) 


+R!/2R°/?(m) 1/2( 9) 1/2 
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Proof. We deduce from Lemmas 18, 19, 21, 22 and 23 that when ordm < U, we 
have 


So AG)e(ay) « Elm)R? + (g)R(R + ond g) + RR?/2(m)!/2(g)—V? 


(y)sR 
y=b (mod m) 


ae RR®!/?(m) Uo '/2 ae R'/2R9/? (m) 1/2( 9) 1/2. 


The lemma now follows by setting 0 = R2/5R. 
We will now derive Lemma 16 from Lemma 24. 


Proof (of Lemma 16). Note that 


(N + ordm)¢(m) + 
de mw (Ge) = ~ x e(an) 
ne) (n)=N 
mn-+b irred 
+ 
= om) ¥ A(mn + b)e(an) 
N(m) ‘nat 
N+ordm + 1 + 1 
oS ot ah) 
(w)=(N(m))'/ (w)=(M(m))'/3 
@ itred @ irred 
+ A 
= (mm) Pz A(mn + b)e(an) + O(N7"? my"). 
N(m) at 


By writing x = mn + b, we have 


FO (rr A y~ A(x)e(ax/m) + O(N~"/?(m)"/). 
- os Peake ‘5 


By the triangle inequality, we deduce that 


A + + 
Ap mn () <N! | > Atse(ax/m) + | > A(s)e(ax/m)|) 
(x) <\(m) (x)<q7!N(m) 
x=b (modm) x=b (mod m) 
+ NV? (my'/?, 
(18) 
Let w € m. By Dirichlet’s approximation theorem, there exist a, g € F,[¢] with g 
monic, (g) < N(m)/N®, (a,g) = 1 and (a/m—a/g) < N8/((mg)N) < (g)~2. Let 
d = (g,m). Then 
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dk NB NB 
a-— < ~< as 
g/d! (g)N ~ (g/d)N 


Since a € m, we must have (g/d) > N®, which implies that (g) > N®(d) > N®. By 
Lemma 24 and (18), we have 


Ap mn (2) 

<K N7! (N*/5 (m)9/5.N4 +4 (g)N° + NN?!?(m)3/2(g\—'/2 4. N/21N9/? (m) (2) 1/2) 
+N712 (my 1/2 

< N7"/5N29/5 4 NAB 4 NO-BI2 4 N-V2NV2 & NOB? = NAA, 


This completes the proof of the lemma. 
nw 
We will next prove a minor arc estimate for Val yi). 


Lemma 25. Let 1 < Q < K and (m) < N. Suppose that (a — a/g) < (g)~? with 
(a, g) = 1. Then one has 


Jaserv(@)] < log, N(N(g)~! + N-Ng)(W? + ord g) + A-VOMN). 


Proof. Let {@\,..., Wr} denote the set of monic, irreducible polynomials w with 
(mw) < Qand a | m. By the inclusion-exclusion principle, we have 


R 
AjOrv(a) = > Ape v(ne(an) = NT] = 1/(a)) "h(a, (19) 


n€. Sn i=1 


where 


R 
na)=S-1I a e(a( TP"), (20) 
=" PsisesisSR (y)=N(m) (ays) 


Wi, Wis y=b (mod m) 


By Lemma 4, we have 
R 
[]@-1/(@))~ « @ «log, N. (21) 


Let J = N/(2A log, N). 1f0 <5 < J, since (wi) < O < N4, we have 


AY 
TI\=) < NAN/(2Alog,N) — yyN/(2log,N) fy /2 
i;) S 
j=l 
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Therefore, by writing y = xb+-my’, where x is the multiplicative inverse of x modulo 
m, it follow from Lemma 20 that 


3 (-1) 3 > e(a( T=?) 
O0<s<J ISi1<<isSR (y)=N(m)/(wy +7) oe 


Wi, Wis y=b (mod m) 


aD ES (Be E'| Ee ce 


(x) sW1/2 © (y)=N(m) / (x) (x) <N1/2 © (y')=N/(x) 
(xm)=1 xy=b(modm) 


(22) 


< NN(g)~! + N'/?2N + (g)(N? + ordg). 


For s > J, we have 


s 7 Wi": Diy — 
yor » yr e(a( MR *)) 
J<s<R 1<i) <++<i,<R (y)=M(m) / (wp) 


Wi, Wis y=b (mod m) 


KD DE Mm)/(mi, ++ o,) 


J<s<R 1S) <*<is<R 


« Nm) S> (s!)“"( Pee + (wp)7!)’ 


J<s<R 


(23) 


« N(m) S> (s!)~'(C; log, log, N)', 


J<s<R 


where the last inequality follows from Lemma 4. By Stirling’s formula, we have 
sl= V25(5) (1+ o(+)). Thus for s > J = N/2A log, N, we have 


Cyelog, log, N\* 
Y= (s)'(CiloglogNy’ «K So sre SSE) 


S 
J<s<R J<s<R 


2 3 — eee) 
N 
J<s<R 


log, Ny 1/2 s(-1+0(1) 
7) ae 


J<s<R 


log, Ny 1/2 
“< ( O84 ) / N(-1+0(1))W/2A 08, N) 


(sen) Ma ventocy < N7MGA), (24) 
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By combining (19)-(24), we deduce that 
A" (@)| « NW! log, N(NN(g)7! + NN + (g)(N? + ord g) + NI" (m)) 
« log, N(N(g) 1 + N-!(g)(N? + ord g) + N-VGAN), 
Lemma 26. Let 1 < Q < K and (m) < N. One has 


sup A(2" (@)| «N78 log,N< nN“, 


aem 


Proof. Let a € m. By Dirichlet’s approximation theorem, there exist a, g € F,[¢] 
with g monic, (g) < N/N®, (a,g) = 1 and (a — a/g) < N®/((g)N) < (g)~2. Since 
a € m, we have (g) > N®. By Lemma 25, 


JA.” (a)| K log, N(N(g)~! + N-!(g)(N? + ord g) + N-/C%N) 
« N** log, N«KN™. 


We now summarize the minor arc contribution in Proposition 6. 


Lemma 27. For 1 < Q < K, we have 
sup [Ap may (02) - 112" (a)| «K<N4 «OT. 
Proof. The lemma follows by combining Lemmas 16 and 26 and noting that 
NO eo, 


Note that by combining Lemmas 15 and 27, we obtain Proposition 6. 


5 Proofs of Theorems 2 and 3 


We will first prove Theorem 2. 
Proof (of Theorem 2). By Lemmas 5 and 7, for 1 < Q < K, we have 


If * Wolloo K O'Ilfl; and [If * WO lla K ONT! [If le. 


By the Riesz-Thorin interpolation theorem [17, 25], we interpolate between these 
two bounds to find that for 6 > 2, we have 


Ife vols «x OTPEPEN 2 IFly. 
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Similarly, since 
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If * Vesilloo K K+" Ulli KNAUF and [fe vegille K NNT [lfllo. 


for 5 > 2, we have 
If * Veale < NAC142/8)+2/3 7-2/8 1 eH, 
Upon recalling that A = 4/(6 — 2), we have 
Ih * Ve yills < NOON fly. 


By the triangle inequality, 


K+1 


n7—-2/5 
If AN alls < So Ute Vo lls KN? Uiflls. 
Q=1 


Therefore, by (3) and (4), we have 


1/2 A 
Tllo+s< sup If *ARnalls? «K AV, 


Iiflls7=1 


This completes the proof of the theorem. 


We will now deduce Theorem 3 from Theorem 2. 


Proof (of Theorem 3). When 6 = 2, the theorem follows from Parseval’s inequality. 
Hence, we assume that 5 > 2. Let (dy)xe@, be a sequence of complex numbers with 


|a,| < 1 forx € Pr. Let 


f@) = i ifx € Pr, 


0, otherwise. 


Then, by setting Apm.n = Ao,1,R, it follows from Theorem 2 that 


R a Ge 1/8 1/8 gr\'? 
ps acs e(aa)| da) = |fls <#if = "(lass 
T 


xE PR xE PR 


KROME, 


Thus 


| > are(x0)} = (/ > ave(es)|'da) < RIVap, 


xE Pr xE PR 


(25) 
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Also, for (8) < g7!R7! and x € Pe, we have (Bx) < q7', implying that 


1/3 
| ») e(x6)|| a (/ z e(ax)| de) 
> ( [, 7 x e(B2)| ab) (26) 
B)<q 


1/8 
> (/ Rap > REVERT, 
(B)<q7!RO! 


The theorem now follows by combining (25) and (26). 


6 Proof of Theorem 1 


To prove Theorem |, we will employ the W-trick (see [7] for a discussion of the 
method). Namely, we will pass to an arithmetic progression with common difference 
equal to a product of small irreducible polynomials and this will allow us to avoid 
some obstacles modulo small irreducible polynomials. It is worth noting that if 
one is able to avoid using the W-trick, the resulting bound in Theorem | could 
be improved to Dy( Yr) K |Prl/ log, | Pal. 


Lemma 28. Let r),72,73 € Fy with r; + r2 +73 = 0. Suppose that Ary © Pr and 
that there is no non-trivial solution to ryX, + rox. + 73x3 = O with x1, x2,x3 © Ap. 
Suppose also that |Ar| > nR/R for some n € R with n > 0. Let 


W= [tog, (“E*)] and = || w. 


(a)<W 


SetN = R/(m). Then for N sufficiently large, there exists & © Sy such that 


¢ There is no non-trivial solution to 71x, + rox. + 73x3 = 0 with x1, x2,x%3 € &, 
° There exists some b € F,[t] with (b,m) = 1 and Apmn(P) = Nn. 


Proof. Let 14, denote the characteristic function of the set Ar. We have 


So SS Lag) = nk/R. 


(b)<(m) — x€ SR 
(b,m)=1 x=b (mod m) 


By the pigeonhole principle, there exists b € F,[¢] with (b) < (m) and (b,m) = 1 
such that 
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A 


nR 
Qs. 
x€SR p(m)R 
x=b (mod m) 


Let S = {n € Yy|mn+ b € Ag}. Thus 


(N + ordm)d(m) 


Abmn(Z@) = Nim) 


S > Lag(mn + b) > n. 


n€.Sn 


Since r} + 7 +73 = 0 and there is no non-trivial solution to rjx1 + x2 +73x3 = 0 
with x1, x2,x3 € Ar, it follows that there is no non-trivial solutions to rjx, + rex. + 
3x3 = O with x), x2, x3 € &. This completes the proof of the lemma. 


In order to apply Lemma 28 with the earlier work in this paper, we need to bound 
(m) in terms of N. Note that 


Ww 
ordm = S ordw = S- K(K/K + O(K'/?/K)) = q(q- DoW o(w!/?). 


(w)<W — 


log, R 
4 


Since W = [ log, ( )]. for R sufficiently large in terms of g, we have 


log R log R 
ordm € | Eg Eq | 


from which we derive that 
N=R/(m)€ [RROV1 RR-VG19) 


In addition, we have (m) < R'/'° < N and W < log, log, N. 

For a set & C Wy anda monic irreducible polynomial w of degree N, we embed 
& into F,[t]/@F,[t] via the bijection x — x(moda@). Also, we define Fourier 
analysis for F,[‘]/@F,[#]: if f.g : F,[|/wF,[t] ~ C andr e€ F,[f]/wF,[t], we 
write 


Fo) = Y> f@ecx/w) and f* g(r) = D> fede(r—2. 
(x)<(w) (x)<(w) 


We define functions k,A : F,[t]//wF,[t] > C by 


(x) 1, if there exists y € & such that x = y(modw), 
K(x) = 
0, otherwise, 
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and A(x) = Apm.w(y), where y is the unique element of Wy with x = y(modw). 
We also define a function a : F,[t]/wF,[t] > C by a(x) = x(x)AQ). First, we 
estimate the function A. 

In what follows, we will fix 6 = 5/2. Thus all implicit constants below depend 
at most on q. 


Lemma 29. We have 


sup |A(z)| (log, NY. 
z40 (mod w) 


Proof, Note that A(z) = Ap. (2/@). For z/™@ € m, by Lemma 16, we have 
\(2) KN“ «K (log, NY. 


Thus we are left to prove the lemma for the case that z/m € Wtag C Wit. By 
Lemmas 9, 11 and 12, we have 


U8) 9( =abit) o( 2. — a) ow, if (g,m) = 1, 


A) = yc ee ie 8 ee 
o( S| : otherwise. 
Because 
NBtlt+e (m) 1/2+¢€ 


-1 
Wi < (log, NN)" 


it is enough to show that when (g,m) = 1, we have 
= zZ a = 
$(g)"'0(= - =) « dog, Ny. 
wo &§ 
For (g) = 1, since z € 0(modw), 


og) 'o( = — 2) = aG/m) = 8" YT elexr/m) = 0. 


xESy 
For (g) > 1, note that |o(~)| < 1 for alla € T. When (g) > 1 and (g,m) = 1, by 
the definition of m, there exists a monic irreducible polynomial w’ with w’|g and 
(w’') > W. Thus 
o(s) 1 < o(o')| KW « (log, N). 


This completes the proof of the lemma. 
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We now prove a discrete version of the majorant property with 6 = 5/2. Note 
that the proof below can be adapted to give a discrete majorant property for any 
6 > 2. 


Lemma 30. There exists an absolute constant C" (q) such that 


> la@p? < c’@). 
(z)<(w) 


Proof, For (w) = N > 1,x € -y and (8) < 1, we have e(22t) a e()e(“4). 
Thus for all (a) < N, by writing aw = z+ 6 with z € Sy_; and 6 € T, we have 


Y OP = DI | YK eCaeeter/w)|” 


(z)<(w) {)<(w) x€.Ay (27) 
=|. ,| > K@A)e(ax/ o)| da. 
)<N x€.SN 


By writing a = wy, we deduce that 


I, a Y> K@)A@el(ax/w)|"” da = vf | D> K@)Aomv ae yx)” dy. 


xE.SN xE.SN 
(28) 
By Theorem 2, 
2/5 
( [|X eorsnvenetra)] ‘ay) 
xE.SN 
= ||Tk|lsy2 < NP lle |2 (29) 
* ee 
= 8-7 OP Aamv) << 8-87. 
xE.SN 


By combining (27)-(29), we find that 


Yd jar? «1. 
(z)<(w) 


This completes the proof of the lemma. 


Let ¢ be a real parameter satisfying 0 < ¢ < 1 and define 


# = #5) = {c ¢ F.d/ oF, [A | |a(@)| = s}. 
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Let k = |@| and write & = {z,,..., 2}. We now are able to define a Bohr set 


B=BE) = {x € F,[d/oF, Id | (|= 


)<qt sis}. 
Define a function 6 : F,[¢]/wF,[t] > C by 


|A\"', ifxeS, 


0, otherwise. 


po | 


We define a function a, : F,[4]/@F,[t] > C by ai(x) = (a * B * B)(X). 


Lemma 31. There exists a positive constant C2(q) such that whenever k < 
log, log, N, we have ||ai|loo < C2x(q)N!. 


Proof. From the definition of a; and Lemma 29, we have 


ay(x) = (a * B * B)(x) < (A*B* B)@)=N! D> AGQ)BO)e(-2y/w) 


(y)<(w) 
<N TAO BOP +N" D> AG)BO)e(-2y/w) 
(y)<(w) 
y#0 
«KN '+N' sup [AO >> [BO)/? 
y40 (mod w) (y)<(w) 


« NT! + (log, N)7"|BI!. 


Recall that ¥ = {z,...,z}. Consider the mapping [ : F,[‘/mF,[‘] > T* 
defined by 


P(x) = ([laxzi/@),-.-. [lxze/ wl). 
Let 
G = {(ay,...,0%) ET’ | (aj) <q | <i<h}. 
By the pigeonhole principle, there exists an element (v),..., vg) € Fi where 


KH = {x(modw) | (x) — (v1,..., vg) € G} 


contains at least Nq-* elements. Let y € #. Then for any y' € #, we have 
I'(y—y’) € Y. Hence, |B] > Nq~*, implying that 


Jai(x)| < N7! + (log, N)“'No'gk « NTT. 
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We will now prove upper and lower bounds for the sum 
NT SO a(rizda(ro2dai (732), 
(2)<(@) 
and we will then deduce Theorem | by comparing these upper and lower bounds. 
Lemma 32. Suppose that there is no non-trivial solution to rx, + r2x2 + 73x3 = 0 


with x; € & (1 <i < 3). Then 


N7' S$ ai(riz)ai (722) (r3z) K NN? + N71G 1, 
(z)<(w) 


Proof. Since there is no non-trivial solution to rx; + ™mx2 + 3x3 = O with 
x; € A (1 <i < 3), we have 


NT SS a(rida(r2darsz) = DI YE alaralan)a(—nry'x1 — rary 1x9) 
{z)<(w) (x1) <(@) (x2) <(@) 


= > a(x)? < > omy)? 
(x)<(a) ye. Sy 
(N + ordm)?¢(m)? 


« - < N°-N7. 
N2 (m) 2 


Since a, = ap?, it follows that 


No! » G1 (112) (12z) a1 (732) 
(z)<(w) 


7 > (a (radar (722) (032) = a(rz)a(r22)a(r32)) 
(2)<(m) 
+ O(N7N~’) = 
= 8 alrgiatrasva(rse) (Br)? Blea) Bers) — 1) 
(z)<(w) 
+ O(N?N~”). 


Note that when z € & andr € Fy, since (||rzx/@||) < q7! for all x € Z, we have 


BGd) = |Al! Y> e(rx/m) = 1. 


xEB 
Thus 


Yar 2)a(r22)4(r32) (B12) B22) Blryz)? — 1) = 0. (31) 


262 
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Note that for all z (mod w), 
|B (riz)? B(r22)? B(rsz)? — 1 < 2. 


By combining Hélder’s inequality with Lemma 30, we have 


Do A(rrz\a(r22)a(r2) (B12)? (722) Braz)? — 1) 


yee 
: (32) 
« sup |a(z)|/? So lah? «5? 
ei ()<(w) 


The lemma now follows by combining (30)—(32). 


Lemma 33. Suppose that k < log, log, N. Then there exists a positive constant 
Cs = Cs(q) such that 


N77! > a1 (71 Z)a) (72Z) a4 (732) > tN 1g Os!", 
(z)<(w) 


Proof. Let 


we {x e F,[]/wF, [el | ai(x) > sat 


By Lemma 31, there exists a constant C7 = C2(q) > 1 such that ||aj|loo < CN-!. 
Thus by Lemma 28, 


C i 
Ifo + Wl = Yh arte) 


(x)<(w) 

= >) @* Bx B)() 
(x)<(w) 

= >> 80) >> Be-y) DS a@-2 
(y)<(a) (z)<(w) (x)<(w) 

=n >. BO) >> Be-y)=n. 
(y)<(w) (z)<(w) 


Hence, we have 


|.o’| > nN/(2Co — n) > CanN, 
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where C3 = 1/(2C2) € (0,1). Let § denote the number of non-trivial solutions to 
rx, + Mx. + 3x3 = 0 with x; € o& (1 < i < 3). Then one has 
AL. [ 7 a C3 3g 
8 y> Alri rain rd = ee (33) 
2)<(w 


Let M € N. By [14, Theorem 1], there exists a positive constant Cy = C4(q) such 
that if M > C4/n, then any subset of Sy of density at least C37/2 contains a non- 
trivial solution to r)x, + 2x2 + 73xX2 = 0. Furthermore, since r; € F, (1 < i < 3), 
the same is true for any space isomorphic to Sjy as a vector space over F,. Now, let 
M <N. There are N(N — 1) choices of (u,v) where u € Ay and 0 < (v) < WN. 
Consider arithmetic progressions of the form W,., = {u + vl] (l) < M} c 
F,[f]/@F, [1]. Let Y = {(u, v) | [Wav @| > C3nM/2}. Note that |W,»0.a//| <M 
for all wu and v. Upon noting that every element x € ./’ lies inside exactly (N - 1)M 
sets W,,,, we have 


|Z |M + (N(N — 1) —|%|)CanM/2 > (N — 1)M |x| > CanN(W — 1). 
It follows that 
|%| = CanN(N — 1)/(2 — Can) = CanN(N — 1)/2. 


Thus there are at least C3 nN (N —1)/2 sets W,,, for which ./ M W,,, has density at 
least C3 /2. Provided that C4/n < M < N, each set W,,y with (u, v) € Y contains 
a non-trivial solution to rx; +12x%2+73x3 = 0. Note that for any non-trivial solution 
1X1 + x2 + 3x3 = 0 with x; € &’ (1 < i < 3), there are at most WM? choices of 
(u, v) so that (x1,x2,x3) € W2 ,. Therefore, provided that [C4/n] < N, by setting 
M = [C4/n], we have 


5> Ean = 1) 


R72 —2C4/n 
2. FAP > nNq : (34) 


The lemma follows by combining (33) and (34) and setting Cs = 2C4. 
We are now in a position to prove Theorem 1. 


Proof (of Theorem 1). Let n, Ar, &, N and R be defined as in Lemma 28, where R 
is sufficiently large in terms of g. Suppose that there is no non-trivial solutions to 
rx, + 1x2 + 13x3 = 0 with x; € & (1 < i < 3). Recall thatk = |2| = |{(z) < 
(ow) | |a(z)| = G31. By Lemma 30, 


ke? < SY a@p? «1. 
(x)<(@) 
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Since k < ¢~*/?, there exists a positive constant Cs = C6(q) such that, upon setting 
5 = Co(log, log, N)~?/°, we have k < log, log, N. By Lemmas 32 and 33, 


nN 'g Olt < NT p> (11 Z)@) (12z) a1 (132) 
(z)<(w) 
< NN? + N11? 
C2472 4. tl -1/5 
« NN’ +N (log, log, N) 


« N7\(log, log, N)~'°. 
Thus 44g~@/" < (log, log, N)~'/°, which implies that 


1 1 
log, log, log, N « —log,n + A < 7, 


From the above inequality, we can deduce that 7 < (log, log, log, N)~!. Therefore, 


we have 


A 1 1 1 
An « « ; 
| Pr| log, log, log,N log, log, log,R _ log, log, log, log, | Pr 


Theorem 1 now follows. 
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The Distribution of Self-Fibonacci Divisors 


Florian Luca and Emanuele Tron 


Abstract Consider the positive integers n such that n divides the n-th Fibonacci 
number, and their counting function A. We prove that 


A(x) < x! C/2+0(1)) log log log x/ loglogx 


1 Introduction 


The Fibonacci numbers notoriously possess many arithmetical properties in relation 
to their indices. In this context, Fibonacci numbers divisible by their index constitute 
a natural subject of study, yet there are relatively few substantial results concerning 
them in the literature. 

Let Y = {ay},en be the increasing sequence of natural numbers such that a, 
divides F,,,: this is OEIS A023172, and it starts 


1, 5, 12, 24, 25, 36, 48, 60, 72, 96, 108, 120, 125, 144, 168, 180, ... 


(as they have no common name, we dub them self- Fibonacci divisors). Let moreover 
A(x) := #{n < x: n € &} be its counting function. 

This kind of sequence has already been considered by several authors; we limit 
ourselves to mentioning the current state-of-the-art result, due to Alba Gonzalez— 
Luca—Pomerance-Shparlinski. 


Proposition 1 ({1], Theorems 1.2 and 1.3). Jt holds that 


1 
(J + o(t)) logx < log A(x) < logx — (1 + 0(1)) Vlog x log log x. 
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We improve the upper bound above as follows. 


Theorem 1. We have that 


log x log log log x 


log A(x) < logx — (5 + o(t)) .. (1) 
2 loglogx 


The main element of the proof is a new classification of self-Fibonacci divisors. 
We now recall some basic facts about Fibonacci numbers. All statements in the 
next lemma are well-known and readily provable. 


Lemma 1. Define z(n) to be the least positive integer such that n divides Fyn) (the 
Fibonacci entry point, or order of appearance, of n). Then the following properties 
hold. 


¢ z(n) exists for alln € N. In fact, z(n) < 2n. 

° Foreverya,b €N, gcd(Fa, Fp) = Frecd(a,b)- 

* For every prime p, z(p) divides p — (2), (8) being the Legendre symbol. 

¢ Ifa divides b, then z(a) divides z(b). 

¢ For every a,b € N one has z(\cm(a,b)) = Icm(z(a), z(b)). In particular, 
Icm(z(a), z(b)) divides z(ab). 

* For every prime p and every n € N, z(p") = p™"—e”).) 2(p), where e(p) := 
Up(Fp)) = 1 and vp is the usual p-adic valuation. 


From now on, we shall use the above properties without citing them. 
Next comes a useful result concerning the p-adic valuation of Fibonacci numbers. 


Lemma 2 ([4], Theorem 1). The following equalities hold. 


0, ifn = 1,2 (mod 3); 
iy 1, ifn = 3 (mod 6); 
pace ifn = 6 (mod 12); 


v2(n) +2, ifn =O (mod 12). 


Us (F,,) = U5 (n) a 


For p # 2,5 prime, 


vpn) + e(p), ifn = 0 (mod z(p)); 


Up (Fn) = 0. ifn £0 (mod z(p)). 


To end the section, we point out an interesting feature of the upper bound in 
Theorem |: it should be, up to a constant factor of the secondary term, best possible. 
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A squarefree integer n is a self-Fibonacci divisor if and only if z(p) divides 
n for every prime p that divides n. This is certainly true if p — (2) divides n 
for every prime factor p of n. This is indeed strongly reminiscent of Korselt’s 
criterion for Carmichael numbers: one should therefore expect heuristics for self- 
Fibonacci divisors similar to those for Carmichael numbers to be valid; in particular 
Pomerance’s [7], which would predict 


log xloglog} 
be AG Sees ody 
loglogx 


The reader should compare this with Theorem 4 of [3], which states that, if (x) is 
the number of composite integers n up to x for which n divides F-(# )> then 


1 ] log log] 
log L(x) < logx— { = + o(1) ee 
2 log log x 


2 Arithmetical Characterisation 


In this section, we show how & can be partitioned into subsequences that admit a 
simple description. 
Note that 1 divides F,, if and only if z(m) divides n and set 


H.:= NnEeN:n/z(n) = k}. 


Our next task is to prove the following characterisation of the <%’s. Let c(k) := 
min .&% whenever <% is not empty. 


Theorem 2. % is empty if k is divisible by 8, 5 or p®”*! for an odd prime p. 
t 


Otherwise, if k = 2%+! I] p;' is its factorisation, 


i=1 


piF2 


t 
© Hh = sc(k)-5'. I] pet as (B1,...,Bi41) ranges over N't! with the 


i=1 
DiF25 
conditions that either Bi+; > 0 ifa; = e(pi), or Bi41 = 0 if a; < e(p;), for 


every i = 1, ifk is odd or 2 times an odd number; 


t 
© th = dc(k)- 591. 2Prt2. I] pet with (B,,..., Brt2) ranging over N'*? as 
i=1 
DiF2,5 


before, if k is a multiple of 4. 
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Proof. We shall henceforth implicitly assume that the primes we deal with are 
distinct from 2 and 5, and all the proofs when some prime is 2 or 5 are easily adapted 
using the part of Lemma 2 regarding the 2-adic and 5-adic valuations. 

Suppose that, for some n, n/z(n) = k, and p@ is the exact power of p that 
divides k. Upon writing k = pk’ and n = p“n’, with k’ coprime to p, this becomes 
n' /k’ = 2(p4n’). In particular, 


d+ v9(n') < Up(Fyptny) = Up(Fu'j) < vp(n') + e(p), 


which is absurd if d > e(p) + 1, so that .% = @ if p°”)t! divides k. 

If on the other hand & fulfills the conditions for <% to be nonempty, we want 
to know for which m € N, givenn € &, mn is itself in @: this will give the 
conclusion, once we know that all the numbers in the sequence are multiples of 
a smallest number c(k) which belongs itself to <%. The proof of this latter fact is 
deferred to Theorem 3 since it fits better within that setting. 

Suppose we have n € &, and take m = Pp +++ p% with a; > 0 for each i; set 
n= pe -++p>»n! with n’ coprime to m and A; > 0 for each i. 

If A; > 0 for all i, then one has 


n _ pe oe “pin! 


k= = 
zn) (pi! ..-payn) 


A Aw 
- Pi Pe n’ 
= ax(A}— 0 ax(Aw—e(py).0 . 
lem(p"™ 1—e(P1) 2(p1), = on e(Pw) (Dw). z(n’)) 


The pj-adic valuation of this expression is v,,(k), so in the denominator either 
max(A; — e(p;),0) is the greatest power of p;, or some of z(p1),...,Z(Pw), z(n’) 
has p-adic valuation A; — vp,(k) = A; — e(p;). Furthermore, one has A; > vp,(k), as 
n has to be a multiple of k. 

Now, the number 


mn peter... phutawy/! 


z(mn) (pit .-- phwt aw pt) 


A 
_ P ita, ae - phwt av p! 


~ ax(A —e 5 ax(Aw+ay—e(pw), 
eng ae OO ae) 


is equal to k if and only if its p;-adic valuation is v,,(k) for each i, that is 
Up (k) = Aj + ai — max(A; + aj — e(pi), Ai — Up, (K)). 


Suppose that the first term in the max is the greater, that is aj > e(pi) — Up,(k). 
The above equality reduces to v,,(k) = e(pj): so in this case each value > e(p;) — 
Up,(k) for a; (with any value of A;) is admissible, if v,,(k) = e(pi), and no value is 


admissible if 0 < up,(k) < e(pi). 
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Suppose that the second term is the greater, that is a; < e(p;) — Up,(k). The 
equality reduces to a; = 0, which is impossible. 

Finally, if A; = O for some i < w, it is no longer true that z(p?') = 
pon), but one has v,,(z(n)) = A; — vp,(k) = 0 = vp,(z(n')), so the 
above proof carries on with minor changes. 

Starting from c(k) and building all the members of <% by progressively adding 
prime factors, we find exactly the statement of the theorem. Oo 


In the remainder of this section, we show that c(k) admits a more explicit 
description. 


Theorem 3. Whenever c(k) exists, we have that c(k) = k|lcm {2 (t)}* 2 being 
the i-th iterate of z. 


Proof. To prove first that such an expression is well-defined, we show that the 
sequence of iterates of z eventually hits a fixed point. 

First note that, for k = []; pi", z(k) = lem {ppmere) : this is a divisor 
of a Icm {z(p;)};, where rad(k) = [|], p; is the radical of k. Consider now the 
largest prime factor P of k: if P > 7, its exponent in the previous expression 
decreases by at least 1 at each step, since the largest prime factor of z(P) is strictly 
smaller than P. Consequently, after at most vp(k) steps, the exponent of P would 
have vanished. By iterating the argument concerning the largest prime factor at each 
step, after a finite number £ of steps, 2‘ (k) will have only prime factors smaller 
than 7; set 2 (k) = 243°5°, 

Recall now Theorem 1.1 of [6]: the fixed points of z are exactly the numbers of 
the form 5/ and 12-5/. By noting that z(2%) = 3-27, 2(3”) = 4-391, 2(5°) = 5°, 
we get that z(273?5°) = 2max(a—2,2)3max(b-1.1)5¢ Since we can continue this until 
a < 2 and b < 1, we are left with a few cases to check to show that the sequence of 
iterates indeed reaches a fixed point. 

As c(k) must be a multiple of k, call T := c(k)/k. Consider next the obvious 
equalities 


T = 2(kT), 
2(T) = 2 (kT), 


22)(T) = 29 (KT), 
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Write x pai y for the statement “‘y divides x”. Then one has 
T = 2(kT) 
= z(lem(k, T)) 
= lem(z(k), <(T)) 
= lem(z(k), 2 (kT) 


= Jem(z(k), e(lem(z(k), <()))) 
= Iem(z(k), lem(z™ (k), 2° (T))) 
= lem(z(k), 2 (k), 2 (KT)) 


= Iem(z(k), 2 (k), 2 (k), ...). 


Note that we have not used yet that kT is the smallest member of ; this 
means the above reasoning works for any member of .%, so that any number 
in & is a multiple of klem(z(k), 2 (k), 2 (k),...). If we manage to prove that 
klem(z(k), 2 (k), z(k),...) is in .%, we will obtain the divisibility argument we 
needed in the proof of Theorem 2. 

Thus, we want to prove that T = lcm {2 (kK) works; it is enough to prove 
that the divisibilities we previously derived are equalities, or in other words that 
z(kT) = z(Icm(k, T)) for T defined this way. 

If k = |], p;" with 1 < a; < e(p;) for each i, then 


(Ce) (0) 


= Icm (Icm {z (p")}, , Icm ae (ri"')}; aes :) 


= lem ({z (p*")},, £2? (p*)},, --) 


on-((T1e)=((#)-# (De) -)) 


We would like to bring the []; p?" into the least common multiple, but some power 
of p; could divide the iterated entry point of some other prime to a higher power. 
Define then m(p;,) to be the largest exponent of a power of p;, that divides z (p;) as 
i andj vary; thus 


and 
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(rir) jen (: (11 2) (11 ; =) 
_. (im ( fppiorter (1 r) , 2) (1 r) -)) 


= tem ({2 (07) 2 FD}, PWD} +) 


We need this to be equal to 


lem ({z (p7')},. {2 (P?)},. {2 (PF},. -) 
= lem(z(k), 2(T)) = z(em(k, T)). 


All that is left to do now is to remark that this is true if and only if their p;-adic 
valuations are equal for each i, or in other words, as p; is coprime to z(p;') = z(pi), 


max(m/(p;) + a; — e(pi)), m(pi)) = m(pi), 


and this is evident. oO 


3 The Proof of Theorem 1 


Let x > 10. One of our ingredients is the following result from [3]. 


Lemma 3 ([3], Theorem 3). As x — oo, 


#{n <x: z(n) = my < x C/2+0(1)) log log log x/ log log x 


uniformly in m. 


Let n € &(x). By Theorem 2, every self-Fibonacci divisor is of the form c(k)m, 
where m is composed of primes that divide k. Thus, write n = c(k)m, where every 
prime factor of m divides k. Let C(x) := x'08!!08+/ les los, We distinguish two cases. 


Case 1. k < x/C(x). 


Let <4 (x) be the subset of such n € &(x). We fix k and count possible m’s 
because c(k) is determined by k; we use an idea similar to the one of the proof of 
Theorem 4 in [2]. Clearly, m has at most w(k) distinct prime factors. Define next 
W(x, y) to be the number of positive integers £ < x whose largest prime factor 
P(£) satisfies the inequality P(£) < y, and let p, be the s-th prime. If Y% is the 
set of the prime divisors of k, the quantity of numbers m < x all of whose prime 
factors are in Y; is of course at most V(x, Paw) < W(x, 2 log x) for x large enough. 
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Here we used the fact that p, < s(log s + log logs) for all s > 6 (Theorem 3 of [8]) 
together with w(k) < 2logk/loglogk for all k > 3. Classical estimates on W(x, y), 
such as the one of de Bruijn (see, for example, Theorem 2 on page 359 in [9]), show 
that if we put 


then the estimate 


1 1 
log W(x, y) = Z (: +O (— y log aa)) ~ 


holds uniformly in x > y > 2. The above estimates (2) with y = 2logx imply 


that there are at most C(x)3!0g3—2log2+0(1))/logloglogx — C(x)°) values of m for any 


fixed k. Summing up over k, we get that 


tA <c@? YI 1 


k<x/C(x) 


x 
< Clay Fo (x large). (3) 


Case 2.x/C(x) <k <x. 


Here, we have kz(k) < c(k) < x, whence z(k) < C(x). Fix z(k) = zin [1, C(@)]. 
By Lemma 3, if we put ZB, := {n € N: z(n) = 2}, then the inequality 


#BAt) < t/C()'/?*°) ~~ holdsas_- tt > ov. (4) 
We now let k € &,. Then n < x is a multiple of kz. The number of such n is 


|x/kz| < x/kz. Summing up the above inequality over k € &, and using partial 
summation and (4), we have 


x y 1 x is d#Z-(t) 
Zz = t 


kEB- % Jx/C(x) 
x/C(x) <k<x 
#B(t) |t=* “  #BAt 
-2(20~ +f 0a) 
Z t t=x/C(x) x/C(x) t 


IA 


x (#B(x) +f dt 
7 x new tC() 2+ 


_ x ( 1 i 1 i *) 
z C(x) 240M " C1240) aot) t 


— A+ o(1))xlog Cx) _ x 
_ zC(x)!/2+0) = zC(x)!/2+0()* 
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where in the above calculation we used the fact that 
c(t) = C(@/?+eO uniformly in t € [x/C(x),x] as x > 00. 


We now sum over z € [1, C(x)], and obtain that 


x 1 
HAG) S CGiarom oe 2 


1<z<C(x) @ 
_ (+0())xlog Cx) _ ” 
= C(x)!/2+0) ~ CG 2+00) (x > 00). (5) 


The desired conclusion now follows from (3) and (5). 


4 Comments 


Of course, the methods we presented apply equally well to other Lucas sequences, 
where analogues of Theorems 2, 3 and | hold; we chose to display the Fibonacci 
case, when the classification takes a particularly simple form. 

To conclude, we make some observations to promote future progress. The 
problem of finding lower bounds for A(x) requires completely different ideas; one 
can prove that 

(a log “2*) 
log A(x) = log#{n < x: c(n) < x, n squarefree} + O | ——————— 
log logx 


so that in order to prove A(x) = x! + O(loslostos+/loglosx) ynconditionally one would 
need to build many squarefree n with small c(n). The best we managed to prove 
is that logc(n) < 3P(n) (by double counting), and logc(n) < 7 Y in(log p) (by 
induction), but neither of these is sufficient. This hints at building numbers n for 
which their prime factors share most of their Pratt-Fibonacci trees (Pratt trees built 
with the factors of z(p) as children of a node p®, taken with their exponents). 

The set of numbers n with small c(n) is both small and large in a certain sense: it 
has asymptotic density 0 and exponential density 1, conjecturally. 

It is likely that c(n) is quite large for most n. Recall that putting 


F(n) = rad | | [ 6%) 
k>1 
then in [5] it is proved that the inequality 


F(n) > ni +e(1) log logn/ log log logn 
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holds for n tending to infinity through a set of asymptotic density 1. Since c(n) is 
quite similar to F(n), we conjecture that a like result holds for c() as well. 
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Some Remarks on Automorphy 
and the Sato-Tate Conjecture 


M. Ram Murty and V. Kumar Murty 


Abstract We present an informal account of the evolution of the Sato-Tate 
conjecture and describe some recent work of the authors that it gave rise to. 


1 The Conjecture 


Let E be an elliptic curve defined over the rationals of conductor N. For any prime 
p not dividing N, we may consider the number of points N, on the reduction of E 
mod p. Following Hasse, we have the inequality 


INp—- (P+ D| = 2p. 
Thus, we may write the integer 
dy = Ny — (p + 1) 
as 
dp = 2,/pcos(@), O< O < az. 


The Sato-Tate conjecture describes the distribution of the “angles of Frobenius” 
6, as p varies. if E has complex multiplication, one expects that the angles are 
essentially equidistributed (after one takes into account the fact that for half the 
primes, namely those that do not split in the field of multiplication, 6, = 2/2). If 
E does not have complex multiplication, then the Sato-Tate conjecture predicts a 
skewed distribution, namely 
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4 


B 
#{p <x: 6 €[a, pl} ~ (2 / si 0) (x) 


where z(x) as usual denotes the total number of primes p < x. The integral can of 
course be evaluated, and so the right hand side may also be written 


4 


(! =f 2 ang sin2a) ) (x). 
21 


The Sato-Tate conjecture is actually a theorem now due to the work of Barnet-Lamb 
et al. [1]. 


2 Origin of the Conjecture 


Where does such a conjecture come from? Mikio Sato was led to it by numerical 
calculations. This is described in the beautiful article [11]. It also occurs in Tate’s 
1964 talk [14] at the Woods Hole Summer Institute on Algebraic Geometry 
organized by the American Mathematical Society. In that talk, he formulated the 
following conjecture about algebraic cycles on algebraic varieties. Let X be a smooth 
projective variety over a number field K. Let L D> K be a finite extension. Consider 
the Abelian group A‘(X;L) generated by codimension i subvarieties (modulo 
homological equivalence) which are homologically equivalent to one defined over L. 
The ¢-adic cycle class gives a map 


Ai(X;L) ® Q > H7(X)(i. 


Here, X is the base-change of X to an algebraic closure K of K, and Hy (X) is the 
t-adic cohomology of X. This is a finite dimensional Qy-vector space on which there 
is a continuous action of Gal(K/K). In the case X is an Abelian variety, we have 


Hi (X) = AH? (X). 
Moreover, H, H (X) is the Qy-dual of the Tate module V(X) defined by 
Ve(X) = lim X[€"] @z, Qe. 


This is a Gal(K/K) module in the evident way as Gal(K/K) acts on X[£"]. 
The cyclotomic character 


Xe: Gal(K/K) = Aut(pgoo) 


gives the action of Galois on €-power roots of unity. The Tate twist Hi (X)(i) is the 
Galois module Hi(X) ® x}. 
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The cycle class map is Galois equivariant, and so the image lies in the subspace 
of H7 (X)(i) fixed by Gal(K/Z). 


Conjecture I (Tate Conjecture 1). The map 
AN(X;L) @ Qe > HER) HME 


is surjective. 


This conjecture is still open in general, though there is now a vast literature on 
establishing it in special cases. One can get a (by now partial) picture of what is 
known from Tate’s article [15] in the Motives volume. 

In [14], Tate computed that for X = E”, with EF an elliptic curve defined over 
K = Q, we have 


(2) if E has CM 


( — (,",) G21) _ if E does not have CM. 


dim A'(X,L) = (1) 


Here L = Qin the non CM case, and Lis the field of multiplication in the CM case. 
To do this calculation, he did first the case i = 1 and then proved that A’ is generated 
by A!. Soon afterwards, Mumford introduced the Mumford-Tate group and using the 
invariant theory of this group, the calculation becomes somewhat simpler. 

Let us consider Tate’s formula (1) in some special cases. For example, form = 2 
andi = 1, we have X = E x E and the codimension | cycles are (up to algebraic 
equivalence) E x {0}, {0} x E, and the diagonal A. In the CM case one has, in 
addition, the graph Acy of the complex multiplication. Thus, the dimension of 
A!(X;L) is either 4 or 3 depending on whether E does or does not have CM. Here, 
we can take L = Qif E does not have CM. If E does have CM, the field L should 
contain the CM field. 

Similarly, form = 3, andi = 1, and E without CM, we have X = EX EXE. 
Denoting by EF), E, and E3 the three copies of EF, we have the 6 generic cycles 
Ex E x {0}, E x {0} x E, {0} x Ex E, Aj x E, E x Ax3 and Aj3 x E where Agy is 
the diagonal in E, x E,. Note that 


6 (7) 5) ¢ 
1 (; ,) : 
3 L-Functions 


For a smooth projective variety over K, consider the Euler product 


&(X,s) = | | det( — Frob, |Hi(X)" (Nv) ) 1. 
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Here the product is over finite primes v of K. This converges for i(s) > 1 + 5 
Thus, when X = E and i = 1, we have the Hasse L-function of E: 


; =I ; =i 
®(E,s) = T]{1- ae) 
= OM ay (Noyt) 


For L a finite extension of K, we may also consider the base change X7, (in other 
words, X viewed as a variety over L) and in this case, we have the corresponding 
Euler product ®;(X/z, 5). 


Conjecture 2 (Tate Conjecture 2). ®;(X/z, 8) has a pole at s = 1 + i of order equal 
to dim A‘(X, L). 
In particular, (1) and the above conjecture predict that 


ea if E has CM 


‘a — (2,) G21) if E does not have CM. 


—ord,= 441 P24 (E77, 5) = 


Here, L = Qif E does not have CM and the field of multiplication in the CM case. 
Tate also suggests that ®,.41(X,s) does not have a pole or zero at the edge of its 
critical strip, namely 


1 3 
=1+4+=(2k+1) = k+25. 
s=1+ 52k +1) +5 


These analytic conjectures are motivated by analogy with Artin L-functions and by 
the Birch and Swinnerton-Dyer conjecture. Again, for X = E’”’, he computes 


_ (6—5OCD 
P(X, 5) J ee 5) . 
Here, 
Mo(s) = f(s) 
and for k > 0, 


eikOv —l ea ikOy —l 
Ms) = | — — 1 — — : 
ei (ao) O- ey) 
If Eyq has CM, the M; are Hecke L-functions and we know they have analytic 
continuation to i(s) = 1 and are non-vanishing on that line. Hence, with L the CM 


field, the order of pole at s = k+ 1 of Dx (E’,, Ss) is the contribution from i = k and 
this is 


Gy = dimA‘(E",L). 
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Now consider the case Eg does not have CM. Let cz denote the order of My 
at s = 1 (assuming that M; is meromorphic at s = 1). Then, we have cp = 1. 
Moreover, as 


Dy(X, S) = M)(s as NOC Mo(s — yO) 
we expect that cp = —1. Also, we expect cx, = 0 for all k > 1. Indeed, we have 
D(X, S) = I] Mo-o;(s —)CDGA), 
0<i<k 


The factors on the right corresponding to i = k andi = k — 1 account for the 
expected pole of the left hand side. 

Suppose also that co,41; = 0 for all k > 0. Then by a Tauberian argument, we 
might expect 


(ci + eH) + cy, 


psx 


1 
7 (x) 
Consider 
1 
F(x) = — kx. 
(x) = Me cK COS kx 


This should then be the distribution function for the 6,. In other words, 


1 2 
F(x) = — (1—cos2x) = —sin’ x. 
I ua 


4 Modular Forms 


At the time, arguing by analogy, Serre formulated a conjecture to describe the 
distribution of the “angles of Frobenius” for any holomorphic cusp form of weight 
k => 2, level N and trivial character that is a normalized eigenform for the Hecke 
operators. For such a form f, let us write 


f(2) = So ap(nyern" 
n>1 


for the Fourier expansion at the cusp ioo. Let us suppose that it is not of CM-type 
(in the sense of Ribet). That is, there does not exist a Dirichlet character y with 
the property that a¢(p) = x(p)a,(p) for almost all p. Then, the Sato-Tate conjecture 
formulated by Serre in this context is as follows. As before, we can write 


ay (p) = 2p*Y?? cos 6, 
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for some 6, € [0, x]. Then 


B 
H{p <x: 6, € [a, B]} ~ (2 / sin? oa) (x). 


At the time that Serre proposed this, the Taniyama conjecture was not yet proven and 
so it could not be considered a generalization of the original Sato-Tate conjecture 
for elliptic curves. 


5 The Symmetric Power L-Functions 


We now know that the M; do not have good analytic properties and the “correct” 
L-functions to consider are 


E eibu(k—2j) \ 
L = 1 — —— : 
£6) IIT ar) 


The product converges absolutely for ¢(s) > 1 so defines an analytic function in 
that half plane. There is a simple relation between the L; and the M;. We have 


Lo(s) = Mo(s) = &(s) 
Li(s) = Mi(s) = LE, s+ 5) 
pity \ I -1 e72i0» \ —! 
4) =] (: - can, (: 7 TaD, (: - a) 
which can be rewritten as 
Lp(s) => Mo(s)M2(s). 


We note that if the middle factor in the above Euler product for Lz were removed, we 
would get good analytic properties for the resulting function only in the CM case. 
In general, we have the relations 


Lx(s) 


Mo(s)M2(s)Ma(s) «+» Mox(s) 


and 
Loe+i(s) = Mi(s)M3(s)Ms(s) ++ +Mox+1(s) 


Using the L,,, Serre proposed a conjectural approach to proving this. If it can 
be shown that all the L,,(s) are analytic for Sis > 1 and non vanishing on the line 
H(s) = 1, then Serre showed [12] that Tauberian theorems could be used to deduce 
the above conjecture. Soon afterwards, Ogg [9] showed that if one had the analytic 
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continuation of the L,, for St(s) > 5 — € for some € > 0, then the non-vanishing 


on the line {i(s) = 1 could be deduced. In the work of the second author [5], it was 
shown that non-vanishing was in fact a consequence of the analytic continuation 
just to R(s) > 1. 

In [6], it is shown that if we had the analytic continuation of all the L,, at s = 1 
(that is, just the point and not the whole line), we could deduce the following weaker 
version of the Sato-Tate conjecture, namely 


1 2 f° 
> Bees =| sin? 6 dé | logx. 
pa COUP Sa 


Op €[a.B] 


The question of how to prove the analytic continuation remained. A conjectural 
approach to proving it was provided by Langlands. 


6 Langlands’ Conjecture 


Now let us restrict to the case E7g does not have CM and consider the family of 
L-functions L,,(s) introduced above. 


Conjecture 3 (Langlands, 1970). There exists 1, € A(GL,»+1) a cuspidal auto- 
morphic representation such that 


L(s, 2m) = Lin(s) 


for all m > 1. 


Given the known analytic properties of the standard L-function associated to a cus- 
pidal automorphic representation of GL,,, a consequence of Langlands’ conjecture 
would be the analytic continuation of the L,,,(s) for all s. 

It is interesting to note that the Sato-Tate conjecture (both the original version 
as well as Serre’s generalization) was proved by Barnet-Lamb et al. [1]. However, 
this was achieved without proving Langlands’ conjecture. What the authors of [1] 
did prove is the potential automorphy of the symmetric power L-functions. This 
means that when the corresponding Galois representation is restricted to a subgroup 
of finite index, it’s L-function is the Z-function of an automorphic representation. 
This, therefore, still leaves open the automorphy of the L,,(s) itself. 

Conjecture 3 is a special case of the Langlands Functoriality conjecture. Indeed, 
beginning with an automorphic representation z of GL, and considering the m-th 
symmetric power of the standard representation 


Tm: GL2(C) > GLin4i(C) 
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the Langlands L-function L(s, 7, rm) is Ly(s). Viewing rm as a map between the 
complex L-groups of GL, and GL,,+1, functoriality would predict the existence of a 
Tim as above. 

For m = 2, this is the work of Shimura and Gelbart-Jacquet. For m = 3 this 
is due to Kim and Shahidi [4] and for m = 4, to Kim [3]. The main result of our 
work is to show that the holomorphy for all m follows from the automorphy of a 
restricted class of Rankin-Selberg products, as we explain below. (After this work 
was completed, we learnt! of recent work of Clozel and Thorne [2] that outlines 
a similar strategy. However, there seem to be differences as the approach of these 
authors involves local conditions and deformations of Galois representations. Using 
their approach, they have now proved the automorphy of L,, for m < 8.) 


7 Rankin-Selberg Convolution 


Another instance of functoriality is the Rankin-Selberg convolution. Denote by 
Ao(n, F) the set of cuspidal automorphic representations of GL, over F and let 
mw; € Ao(n;,F) fori = 1,2 and assume both are unitary. The Rankin-Selberg 
convolution provides an L-function L(s, 2; X 22) which by the work of Jacquet and 
Shalika extends to a function analytic everywhere except possibly at s = 1 where it 
has a simple pole if and only if 7; ~ m5 (the dual of 22). Another special case of 
functoriality would be the following hypothesis. 


Conjecture 4. There exists a map 
Ar(m,n2) : Ao(m,F) x Ag(m2,F) > Alnn, F) 
in which a pair (zt), 72) is mapped to 73 in such a way that 
L(s, 73) = L(s,m X 7). 
It is known that Hp(n, 1) exists for all n. This is due to Godement-Jacquet. The work 


of Ramakrishnan [10] shows that Hr(2, 2) exists and the work of Kim and Shahidi 
[4] shows that H;-(2, 3) exists. Our main result is the following. 


Theorem 1. Suppose Hg(n, 2) exists for all n. Then all the L, are automorphic. 


'Thanks to Florian Herzig for informing us of this. 
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8 Brief Outline of the Proof 


We will outline the main steps in the proof of Theorem |. The details will be 
presented elsewhere. We note that contrary to the expectation that the proof of the 
analytic continuation of the L,,, could be used to prove the Sato-Tate conjecture, our 
argument uses the Sato-Tate theorem to deduce the analytic continuation. 

There are three main steps of the proof. The first is to define the notion of virtual 
automorphy. We say that a Dirichlet series 


F(s) = ar 


is virtually automorphic if 
F(s) = | [L6, 2)" 


where each z; is an element of A(n;, Q) for some positive integer nj; and some b; € Z. 
We say it is virtually cuspidal if all the 7; in the above factorization are in Ag(n;, Q). 
Now, we use the hypothesis H(n, 2) to show that the L,, are virtually cuspidal. 

In the second step, we have a natural notion of Rankin-Selberg convolution F x G 
of two virtually cuspidal Dirichlet series F and G. Using this, and the self-duality of 
the L,,, we deduce that 


* \ bib 
Leg XI =| | Lag eaghy)™. 
jk 


The right hand side has a pole at s = 1 of order = > by using a result of Shahidi 
on Rankin-Selberg convolutions. 

The third step is to show that the left hand side in fact has a simple pole at 
s = | as aconsequence of the Sato-Tate theorem and an application of a Tauberian 
theorem. This implies that only one of the b; is non-zero and must in fact be +1. It 
can be deduced (using the location of trivial zeros) that it is in fact equal to | and 
thus L,, is automorphic. The details are presented in [7]. 


9 Variants 


Let S(N,k) denote the space of holomorphic cusp forms of weight k for I(N) 
and denote by 7, = T,(N,k) the n-th Hecke operator acting on S(N,k). Then 
the eigenvalues of the normalized operator T,/p“~/? lie in the interval [—2, 2]. 
The Sato-Tate conjecture (in the general form given to it by Serre) concerns the 
distribution of these eigenvalues as N and k are fixed and p varies. In [13], Serre 
began a study of the distribution properties when p is fixed and N and k vary. 
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In particular, he proved that for a sequence (Ny, k,) with Ny, + k, — oo, and for p 
a prime that does not divide any of the Nj, the eigenvalues of T, (Na, k,)/ pe-D/ a 
are distributed in [—2, 2] according to the measure 


p+1 (—2x/4)3 
H (pi+p 2? —# 


D = 


This was made effective in the work of the first author and Sinha [8]. Denote by 
s(N, k) the dimension of S(N, k). Let a); for 1 < i < s(N,k) denote the eigenvalues 
of T, acting on S(N, k) (counted with multiplicity). Then, in ([8], Theorem 2), it is 
proved that 


logp 
log kN 


. Di , 
#1 <i 9,8): np € [a, B}} = [ Hp + OC ) 


s(N, k) 


where the implied constant is effectively computable. 
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Division Polynomials with Galois Group 
SU3(3).2 & G)(2) 


David P. Roberts 


Abstract We use a rigidity argument to prove the existence of two related 
degree 28 covers of the projective plane with Galois group SU3(3).2 = G»(2). 
Constructing corresponding two-parameter polynomials directly from the defining 
group-theoretic data seems beyond feasibility. Instead we provide two independent 
constructions of these polynomials, one from 3-division points on covers of the 
projective line studied by Deligne and Mostow, and one from 2-division points 
of genus three curves studied by Shioda. We explain how one of the covers also 
arises as a 2-division polynomial for a family of Gz motives in the classification of 
Dettweiler and Reiter. We conclude by specializing our two covers to get interesting 
three-point covers and number fields which would be hard to construct directly. 


1 Introduction 


Suppose Y is a variety over Q with bad reduction at a set S of primes. For any prime 
£, there are associated number fields coming from the mod £ cohomology of the 
topological space Y(C). On the one hand, these number fields are interesting because 
their Galois groups tend to be Lie-type groups and their bad reduction is constrained 
to be within SU {£}. On the other hand, defining polynomials for these number fields 
are often beyond computational reach, even for quite simple Y and very small €. In 
this paper, we work out some remarkable examples in this framework, with our 
computations of defining polynomials being ad hoc and just within the limits of 
feasibility. 


1.1 Section-by-Section Overview 


Section 2 provides background on the theoretical context, presenting it as a 
generalization of the familiar passage from an elliptic curve to one of its division 
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polynomials. It then gives information on the Lie-type group which plays the central 
role for us, namely $U3(3).2 © Gp(2). Finally, the section reviews an earlier 
construction of a one-parameter polynomial for this Galois group due to Malle and 
Matzat [12, p. 412]. 

Section 3 explains how a rigidity argument gives two canonical degree 28 covers 
of surfaces defined over Q, each with Galois group SU3(3).2  Gp(2). In our 
notation, these covers are 


my: X, > U344, Ty : Xp > U32. 


The bases are respectively U3, = Mo,5/S3 and U3. = Mo 5/(S3 x S2), these 
being moduli spaces of five partially distinguishable points in the projective line. 
We explain how the covers are related by a cubic correspondence deduced from an 
exceptional isomorphism U3. & U2,12 first studied by Deligne and Mostow [5, 
Sect. 10]. Standard methods, as illustrated in [14], might let one construct the covers 
x; directly if certain curves had genus zero. However these methods are obstructed 
by the fact that these curves have positive genus. 

Sections 4, 5, and 6 concern varietal sources for our covers. Section 4 starts with 
two different two-parameter families of covers of the projective line considered 
by Deligne and Mostow [4]. Via the group SU3(3).2, these families of curves 
yield zr; and x2 from 3-division points. We use the second family to compute a 
defining polynomial F(a, b,x) for m2, and then transfer this knowledge to also 
obtain a polynomial F)(p,q,x) for 2. Section 5 starts with a large family of 
genus three curves studied by Shioda [16]. This family already has an explicit 
2-division polynomial S(r1,13,174,1s,16,17,19,Zz) With Galois group Sp.6(2). We 
find appropriate loci in the parameter space where the Galois group drops to the 
subgroup G2(2), and thereby independently get alternative polynomials S$} (p, g, z) 
and S>(a,b,z) for the two covers. Section 6 explains how F)(p,q,z) also arises 
as the 2-division polynomial of a family of motives with motivic Galois group 
G» studied by Dettweiler and Reiter [6]. Sections 4, 5, and 6 each close with a 
subsection explicitly relating L-polynomials modulo the relevant prime £ to our 
division polynomials. 

Section 7 shifts the focus away from varietal sources and onto specializations 
of our explicit polynomials. Specializing to suitable lines, we get 14 new degree 
28 three-point covers with Galois group SU3(3).2 & G»(2). These covers all have 
positive genus, and it would be difficult to construct them directly by the standard 
techniques of three-point covers. 

Section 8 specializes to points, finding 376 different degree 28 number fields with 
Galois group SU3(3).2  G2(2) and discriminant of the form 2/3". Again it would 
be difficult to construct these fields by techniques within algebraic number theory 
itself. We show that a thorough analysis of ramification in these fields is possible, 
despite the relatively large degree, by presenting such an analysis of the field with 
the smallest discriminant. 
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1.2. Computer Platforms 


The bulk of the calculations for this paper were carried out in Mathematica [19]. 
However most calculations with number fields were done in Pari [17] while most 
calculations with L-functions were done in Magma [2]. 

Many of the statements in this paper can only be confirmed with the assistance 
of a computer. To facilitate verification and further exploration on the reader’s part, 
a commented Mathematica file on the author’s homepage accompanies this paper. 
It contains some of the formulas and data presented here. 


1.3 Relation to a Similar Paper 


The polynomials F\(p, g,x) and F>(a, b, x) are similar in nature to the polynomials 
207(u, v,x) and gog(u, v,x) of [14] which have Galois groups W(E¢) and W(E7)T 
respectively. However [14] and this paper focus on different theoretical topics to 
avoid duplication. The discussion of monodromy and the universality of specializa- 
tion sets in [14] applies after modification to the new base schemes U3; and U3. 
here. Similarly, our general discussion of division polynomials here could equally 
well be illustrated by g27(u, v, x) and gog(u, v, x). 


2 Background 


This section provides some context for our later considerations. 


2.1 Division Polynomials 


Classical formulas [18, p. 200] let one pass directly from an elliptic curve Y : y? = 
x? + ax + b to division polynomials giving x-coordinates of their n-torsion points. 
Initializing via f, = 1 and f, = 2, these division polynomials f, for n > 3 are 
computable by recursion: 
fs = 3x4 + 6ax* + 12bx — a’, 
fs = 4x° + 20ax* + 80bx? — 20ax* — 16abx — 4a* — 32d’, 
tom = fin (aaa ey — fn—2fm-+1) }2, 
Sam+1 = x + ax + bY fom-+2fom — fom—ifim-t1 ’ 
fam43 = Poisoned ~ (x + ax + DY Dados 
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Special cases give particularly interesting number fields. For example, at 
(a,b) = (—1/3, 19/108) the degree sixty polynomial fi; € Q[x] has Galois group 
GL2(11)/{+1} and field discriminant —11!%. 

On an abstract level, there are interesting number fields from n-torsion points on 
any abelian variety over Q. More generally, from any variety Y over Q there are 
interesting field extensions from the natural action of Gal(Q/Q) on the cohomology 
groups H"(Y(C), Z/nZ). However for most pairs (Y, 7), there is nothing remotely 
as explicit as the above recursion relations. In fact, there is presently no way at all 
to produce explicit division polynomials describing these fields. 


2.2 The group SU3(3).2 = G2(2) 


The Atlas [3] provides a wealth of group-theoretic information about the group 
SU3(3).2 & Gp(2). In particular, this group has the form /.2, where I” has order 
6048 = 2°337 and is the 12th smallest non-abelian simple group. 

Table 1 presents some of the information that is most important to us. The left 
half corresponds to the 14 conjugacy classes in J”. The six classes 1A, 2A, 3A, 3B, 
AC, and 6A are rational, while the remaining classes are conjugate in pairs over the 
quadratic fields Q(i), Q( /—7), Q(i), and Q(i) respectively. When one considers the 
full group I°.2, these pairs collapse and one has 16 conjugacy classes, ten in J” and 
six in 7.2 — I’, with 12c and 12d two classes conjugate over Q( /—3). 

Of particular importance to us is that 1.2 embeds as a transitive subgroup of 
the alternating group Ag. The cycle partition A235 associated to a conjugacy class is 
given in Table |. The group I.2 also embeds as a transitive subgroup of A3¢ and the 
corresponding A36 are given. We use the degree 36 embedding only occasionally. 
For example, it is useful for distinguishing 3A from 3B via cycle partitions. As a 
convention, if we do not refer explicitly to degree we are working with the degree 
28 embedding. 

As just discussed, Table | has information about permutation representations 
of I.2. We are also interested in linear representations, and some group-theoretic 
information is contained in the small tables at the end of Sect. 4.3 (for characteristic 
3), at the end of Sect. 5.5 (for characteristic 2), and in Fig. 3 (for characteristic zero). 


2.3 Rigidity and Covers 


Some fundamental aspects of our general context are as follows. Let G be a finite 
centerless group and let C = (C},..., C,) bea list of conjugacy classes in G. Define 


»(C) = {(g1,...,82) €C1 X +++ X Cl: gi gz, = 1}, 
INC) = {Gyo s03he) € DO) ehh 08) = Gh 
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Table 1 Information about conjugacy classes in /.2 


Classes in I~ Classes in 1.2 — I 

Cc IC| A28 A136 Cc IC| Ars A36 

1A 1 8 17° 2b 252 21214 | 21614 
2A 63 y2'?14 | 2!24!2 4d 252 4614 | 4696 
3A 56 3°1 312 6b 2016 6431 =| 6°3!211 
3B 672 3°1 31143 8c 1512 834 834221? 
4AB |2-63 | 4°14 4626 12cd |2-1008 | 12731 | 1226? 


4C 378 4622 | 469414 
6A 504 6431 | 6434 
TAB |2-864 |7+ Pi 
8AB | 2-756 | 832!1? | 8343 
12AB |2-504 | 12731 | 1276? 


The group G acts on these sets by simultaneous conjugation and the action is free 
on X(C). The mass of C is Z(C) := | (Ci,...,C-,)|/|G|. A classical formula, 
presented in e.g. [12, Theorem 5.8], gives the mass as a sum over irreducible 
characters of G, 


4. 1 (Gla G CS 
C)= : 1 
BO) = ep aye ” 


We say that C is rigid if w(C) := |X (C)|/|G| = 1 and strictly rigid if moreover 
B(C) = 1. 

Let G C 5S, now be a transitive permutation realization of G such that 
the centralizer of G in S, is trivial. Let t|, ..., t, be distinct points in the 
complex projective line, connected by suitable paths to a fixed base point. A tuple 
(g1,---, 8) € S(C) then determines a degree n cover of the projective line with 
monodromy group G and local monodromy transformation g; about the point 7;. 
The genus g, of the degree n cover is calculated via the cycle partitions A; - n by 
the general formula 


JAi] +++ + Az] = @— 2)n + 2 — 2gn. (2) 


Here |A;| indicates the number of parts of ;. 
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Let Moy be the moduli space of w labeled distinct points in the projective line. 
This is a very explicit space, as T,, T2, and T3 can be uniquely normalized to 0, 1, 
and oo respectively. The group S,, acts on Mo,, by permuting the points. If vp = 
(11,...,,) sums to w then we let S, = S,, x--- x Sy, and put Uy = Mow/S). 

When C = (C),...,C-,) is rigid and the t; move in Mo,,, all the covers of the 
projective line fit together into a single cover of Mo,,+1. Moreover, under simple 
conditions as exemplified below, this cover is guaranteed to be defined over Q. When 
z = 3, the space Mp,3 is just a single point and Mo,4 identified with P! — {0, 1, oo}, 
with t4 serving as coordinate. This case has been extensively treated in the literature. 
When z > 4 the situation is more complicated and a primary purpose of [14] and 
the present paper is to give interesting examples. When some adjacent C; coincide, 
the cover descends to a cover of the corresponding quotient U,, of Mo,.+1. 


2.4 The Malle-Matzat Cover 


Malle and Matzat computed the cover coming from the strictly rigid genus zero 
triple (4d, 2b,12AB) belonging to the group I.2. This Malle-Matzat cover is 
similar, but much simpler, than the covers z, and x2 that we are about to consider. 
Accordingly we discuss it here as a model, and use it later as well for comparison. 
Identifying the degree 28 covering curve X with P!, the cover P| — P! is then 


given by the explicit degree 28 rational function 


— (x4 + 20x7 + 114x? + 68x + 13)- 
(x° — 6x5 — 435x — 308x3 + 15x? + 66x + 19)" 


P= 
2239 (x2 + 4x + 1)!" (2x +1) 


The partitions A; = 4°14 and A3 = 12731 are visible as root multiplicities of the 
numerator and denominator respectively. Rewriting the equation as 


m(t, x) = 0, (3) 


the partition A. = 2!714 likewise appears as the root multiplicities of m(1, x). 
The discriminant of the monic polynomial m(t, x) is 


Dn(t) = PR aes ak (ae 1, (4) 


It is a perfect square, in conformity with the fact that [2 lies in the alternating 
group A». Thus D,,,(t) is not useful in seeing how the .2 enters Galois-theoretically. 
In fact, the order two quotient group corresponds to the extension of Q(t) generated 
by /r(1 —2). 

The general theory of three-point covers says that X — P, has bad reduction 
within the primes dividing |J.2|, namely 2, 3, and 7. A particularly interesting 
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feature of D,,(t) is that it reveals that in fact the Malle-Matzat cover has good 
reduction at 7. In [13, Sect. 8], we explained how the Malle-Matzat polynomial is 
a 3-division polynomial for a family of varieties with bad reduction only in {2, 3}, 
and this connection explains the good reduction at 7. 


3 Rigid Covers of U3; and U3,2 


This section explains how general theory gives the existence of our two main covers 
mw 1X, — U31,1 and m2 : X2 — U3 and the cubic relation between them. 


3.1 Five Strictly Rigid Quadruples 


For a fixed number of ramifying points z and a fixed ambient group G, the mass 
formula (1) lets one find all C with 7(C) = 1. From any explicit tuple (g1,..., 2) € 
X(C), one gets 4(C) = 1 or 0 according to whether (g1,...,g-) is all of G or not. 
Carrying out this mechanical procedure for z = 3 and G = I..2 gives several strictly 
rigid triples, with only the Malle-Matzat triple having genus zero. For z = 3 and 
G = I’, one gets yet more rigid triples. None of these have genus zero and some of 
them appear in Table 2 below. 
Applying this mechanical procedure for z = 4 yields the following result: 


Proposition 3.1. There are no strictly rigid quadruples in [’.2. Up to reordering 
and conjugation by the outer involution of I’, there are five strictly rigid quadruples 
in DT: 


(3A, 3A, 3A, 4B): (genus 9), (4A, 4A, 4A, 2A): (genus 6), 

(4A, 4A, 4A, 4B): (genus 9), 

(2A, 2A, 3A, 4A): (genus 3), (4A, 4A, 3A, 3A): (genus 9). 
Moreover, there are no other rigid quadruples C with [(C) < 4. oO 


The list of all quadruples considered in the process of proving the proposition 
makes clear that the five quadruples presented stand quite apart from all the others. 
For the case G = [.2, the quadruples C with the smallest 2(C) are (4d, 2b, 2A, 2A), 
(4d, 2b, 3A, 2A), (4d, 3d, 4AB, 2A), (2b, 2b, 3A, 2A), and (4d, 4d, 3A, 2A). The cor- 
responding (u(C),/(C)) are (0,2.750), (3,3.000), (0,3.375), (0,3.500), and 
(3, 3.666). For the case of G = I, there are 15 other C with 7(C) € [1, 2); all have 
LU(C) = 0. Likewise, there are 12 C with 7(C) € [2,3); four have ~(C) = 0 and 
eight have w(C) = 2. Continuing the trend, there are eight C with Z(C) € [3, 4); 
two have 4(C) = 0 while six have ~(C) = 3. In particular, as asserted by the 
proposition, j4(C) = 1 does not otherwise occur in the range [1(C) < 4; we expect 
that 4(C) = 1 does not occur either for 7(C) > 4. 
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3.2. The Two Covers 


In this section, we explain how the left-listed quadruples in Proposition 3.1 all give 
rise to the same cover 7 : X; —> U3,,1 while the right-listed quadruples both give 
rise to the same cover m2 : X2 — U3. Figure | provides a visual overview of our 
explanation. 


3.2.1 The Base Variety Mo\5 


Let 


1 
Mos = Spec fut s(s— Dt -—DG- 7 


be the moduli space of five distinct ordered points in the projective line. The 
description on the right arises because the five points can be normalized to 0, 1, 
oo, s, t by a unique fractional linear transformation. 

A naive completion of Mo,5 is Mos = Pp! x Pl. The top subfigure in each column 
on Fig. | gives a schematic representation of the real torus Mo,5(R). As usual, one 
should imagine the subfigure inscribed in a square, with the torus obtained by 
identifying left and right sides, and also top and bottom sides. Here and in the rest 
of Fig. 1, coordinate axes are distinguished by darker lines and lines which are at 
infinity in our particular coordinates are indicated by dotting. 

A more natural completion Mos of Mos is obtained from blowing up Mos 
at the three triple points (0,0), (1,1), and (co, co). The natural action of Ss on 
Mos extends uniquely to Mos. Reflecting this equivariance, lines in Mos — Mos 
are naturally labeled by two-element subsets of {0, 1, 00,5, t}. Another reflection 
of equivariance is that elements of {0, 1, 00, 5, t} index fibrations over genus zero 
curves. The fibrations p, and p; are projections to the t and s axes respectively. The 
smooth fibers of po and p; are the lines going through (1, 1) and (0, 0) respectively 
of slope different from 0, 1, oo. The smooth fibers of poo are certain hyperbolas 
going through both (0, 0), (1, 1). Note how each fibration partitions the ten lines of 
Mos — Mp, into four sections and six half-fibers, the latter coming in three pairs to 
form the singular fibers. 

Consider (01), (st), and (Oloo) in their action on Mos. The group $3 x S> that 
they generate acts on the naive compactification Mo,5. This action can be readily 
visualized in terms of our pictures of Mo5(IR): (01) is a half-turn about the point 
(1/2, 1/2), (st) is a reflection in the diagonal line, and (0100) is a simultaneous 
one-third turn of the coordinate circles P! (IR) and P!(R). 
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ty : to 


Fig. 1 Base varieties, ramification divisors, and associated conjugacy classes 
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3.2.2 Quotients of Mo,s 


Figure | schematically indicates five planes, each with their own coordinates, 
as indicated by axis-labeling. The maps between these planes have the degrees 
indicated in Fig. 1, and are given by the following formulas: 


_— (i -)s (1 — 1)? Phe — 458 _ ae 
(a) = (Ga Oe), (u, v) =((s2 to)”, (82 + to 1)’), 


3Qu—v+1) 3uu—v+2)) (, y_ (a768e 9A 
a, b) = |. 
(u—v+2)?’? Qu—v+1)P?/ W2 °>W 


va = ( 


Here A = uw? + v* + 1 — 2u— 2v — 2uv is a quantity which will play a recurring 
role, while W = u? — 10uv + 6u + 9v? — 18v + 9 is a quantity which appears 
explicitly here only. Two moduli interpretations of (u, v), identifying U with U21.11 
and U 1.2 respectively, are given in (8) and (9) below. The moduli interpretation 
of (p,q) appears in (6) and (7) below. The moduli interpretation of (a,b) is less 
direct, but arises from the relation (5) below. The four maps displayed above are 
consequences of these moduli relations. 

Our considerations are mainly birational, and so it is not of fundamental 
importance how we complete the various planes. As the diagrams indicate, three 
times we complete to a product P! x P! of projective lines, while twice we complete 
to a projective plane P?. We are starting with two copies of the same variety, with 
U ; 5 having coordinates s; and t;. The other varieties are quotients: 


U = Ujs/((O1)), U = Ujs/((01), (st)), 
U31,1 = Uls/((01), (0100), U32 = U;;/((01), (100), (s?)). 
Blowing up some of the intersection points would yield more natural completions, 


but we will not be pursuing our covers at this level of detail. 
The natural double cover U3,1,; — U3,2 is given in our coordinates by 


(a,b) = (pq — 6pq + 4p + 4q — 3, pq). (5) 
Inserting this map on the bottom row of Fig. 1 would of course make the bottom 
triangle not commute, as even degrees would be wrong. Because of this lack of 


commutativity, the behavior of X; over curves and points in Fig.5 is not directly 
related to the behavior of X2 over the pushed-forward curves and points in Fig. 6. 


3.2.3. Covers of Mo\s 


The five rigid tuples of Proposition 3.1 enter Fig.1 through our associating 
conjugacy classes in I" to lines. On the top-left subfigure, from any fixed choice 
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of s € C — {0,1} one has a cover of P!(C) ramified at 0, 1, 00, and s. The local 
monodromy classes associated to moving in a counter-clockwise loop in the t-plane 
about these singularities form the ordered quadruple (3A, 3A, 3A, 4B). On the top- 
right subfigure they form (4A, 4A, 4A, 2A). 

But now by rigidity one has local monodromy classes associated to all ten lines of 
Mos — Mos. Using the monodromy considerations of [14], we have computed these 
classes. The classes are placed in the top two subfigures of Fig. 1. Interchanging the 
roles of s and t, one sees that the cover of Mo,s indicated by the top-left subfigure also 
arises from (4A, 4A, 4A, 4B). However the top right cover now just arises in a new 
way from the original quadruple (4A, 4A, 4A, 2A). Via any of the three remaining 
projections 9, P1 Poo, the covers represented by the top-left and top-right subfigures 
arise respectively from (2A, 2A, 3A, 4A) and (4A, 4A, 3A, 3A). 


3.2.4 Descent to Covers of U3 1,1 and U3,2 


The labeling by conjugacy classes on both the top-left and top-right copies of Mo,5 
is visibly stable under the action of $3; = ((01),(Oloo)). Moreover on the top- 
right, the labeling is also stable under the diagonal reflection (st). One therefore has 
descent, to a cover mz; : X; — U3, on the left and a cover m2 : X2 — U3 on the 
right. 


3.3. Summarizing Diagram 


We now shift attention from Figs. 1 to 2. The lowest varieties U3,1,1, U3,2 and their 
common cubic covering by U from Fig. 1 are redrawn in the left part of Fig. 2. 
The two copies of Mo,5 from the top of Fig. 1 now play a secondary role and are 
suppressed. In their place, the degree 28 coverings X, and X> discussed above are 
now explicitly indicated. Also Fig. 2 contains their common base change to Xp —> U. 


Fig. 2 Left: the covers 7 Xo Q(xo,yo) 
and 7, as related by the Dy Do 
cover 1. Right: ral Se 
corresponding function fields Xi Xo Q(#1,y1) Q(ax2,y2) 
TO 
1 m2 


U Q(u,v 


Zs NAS 


U3,151 U3,2 Q(p.¢) Q(a,b) 
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The left part of Fig. 2 commutes, and so the upper maps 7, like the lower maps 
o; from Fig. 1, have degree three. Note that while the top-left part of Fig.2 has 
been canonically defined, we do not yet have an explicit description for any of the 
surfaces X; or maps ;. We do not yet have an explicit description of the vertical 
maps 7; either. In particular, we have not yet discussed the coordinates x;, y; from 
the top-right part of Fig. 2. 


4 3-Division Polynomials of Deligne-Mostow Covers 


Here we first recognize 7; : Xj — U3, and m2 : X2 — U3 as associated to 
3-division points on certain Deligne-Mostow covers. Using this connection, we 
compute zt directly and then deduce explicit formulas for zp and z,. The last 
section calculates some sample L-polynomials and illustrates how their mod 3 
reductions are determined by our equations for the 77;. 


4.1 Local Monodromy Agreement 


Deligne and Mostow’s treatises [4, 5] concern curves presented in the form y” = 
f (parameters, x) and the dependence of their period integrals on the parameters. 
Their table in Sect. 14.1 of [4] has 36 lines, each corresponding to a family. Their 
lines 3 and 2, written using our parameters p and q on U3, ), are 

y4 


4 
y 


II 


x° (px? + 3x° + 3x4+q), (6) 
x(px? + 3x7 + 3x + q)’. (7) 


In both cases, the complex roots of f(x) are the three roots a1, a2, and a3 of 
px? + 3x? + 3x + q and a4 = 0. A series solution for each equation in the variable 
x—Q4 = xis 


3x 27x? 3x 9x? 
page (1+ pate): y= aia (14 mo Gat)- 


The important quantity for us is the leading exponent associated with a4, namely 
flag = 1/2 and 4 = 1/4 in the two cases. Similarly, expanding in the local 
coordinates x — a; fori € {1,2,3}, one has ~; = fz = ps. In the two cases, 
these exponents are 1/4 and 1/2 respectively. 

Corresponding to the title of [5] containing just PU(1, n) rather than more general 
PU(m,n), Deligne and Mostow are interested in the case when the sum of the ju; 
corresponding to roots of f(x) is in (1, 2). A leading exponent at ov, here jus, is then 
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defined so that the sum of all jz; is 2. So, summarizing in the two cases, the exponent 
vector is 


l1lii11 
(M1, 12,3, 64. M5) = (7.7.7, 5,7), (i. M2, 3,45) = (5,5.5.7.7]- 


These are the quantities actually presented on lines 3 and 2 of the Deligne-Mostow 
table. From j44 = [Us one has descent from U3; to U3,2 in the second case, but not 
the first. 

Switch notation to (U9, [41, (Loo, [ts, [4r) to agree with the previous section. The 
local monodromies about the divisor of Dj, as classes in GL3(C), are represented by 


10 0 
m= | 0 1 1 
0 0 exp(27i(uj; + Lx) 


Here the off-diagonal 1 can be replaced by 0, except in the case uj + py € Z, Le. 
Mj + Me = 1. 

Global monodromy is in fact in a unitary subgroup of GL3(Z[i]). The matrix myx 
has infinite order if 4;+ 4, = 1, and otherwise has the finite order denom(4j+/4x) € 
{2, 4}. Reducing to PU3(F3) C PGL3(Fo), the infinite order m, acquire order 3 and 
the finite order mj, maintain their order. Moreover, not just the orders but even the 
conjugacy classes can be shown to agree with those presented at the top of Fig. 1. 
Thus the rigid covers of the previous section are realized as 3-division covers. 


4.2 Explicit Equations 


The following theorem gives equations describing the three covers mo, 7, and 7. 
A preliminary comment about the contrast between curves and surfaces is in order. 
Requiring automorphisms to fix C pointwise, Aut(C(x)) is just PGL2(C) while 
Aut(C(x, y)) is the infinite-dimensional Cremona group. A consequence is that any 
given F : P! — P! is already in a good form. Furthermore, one has only a total 
of six degrees of freedom in adjusting domain and target coordinates in order to 
get a particularly nice form, like that of the Malle-Matzat cover. However a given 
F : P? — P? may be in far from best form, and adjusting coordinates to improve 
the form seems to be more of an art than a science. The theorem gives the best form 
we could find in each case, but does not exclude the possibility that there are more 
concise forms. 


Theorem 4.1. The surfaces Xo, X\, and X» are all rational. There are coordinate 
functions (x;, yi) on X; so that the top five maps in the left half of Fig. 2 are as follows: 
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The three covers with domain Xo. Abbreviate (x, y) = (xo, yo) and 


= 15x? — 4yx —4x+ 5, 
86a = Oxy’ + yy? + 18xy — 18y — 66x + 6, 
86p = 225x° — 30yx — 30x — 2y’ + 6y + 33, 
81q = Syx* + 65x? — 2y’x — yx — 2x + Sy +5, 
81h = 45yx* — 105x + 6y?x — 8yx — 14x — Sy —5, 
go = 225x° — 30yx* — 105x” + y’x + 22yx + 21x + y’ — 8y—9, 
S10 = 225x7y* + 1200x7y + 2850x7 + 250xy" — 1500x + 2y* + 8y? + 37y" 
—192y + 402, 
817 = 2025x*y + 24300x7y + 39150x7 + 540x7y? + 1845x7y — 180x7y 
—29610x? — 18xy* + 168xy? — 213xy? — 252xy + 9522x + 10y4 
+60y? — 105y? + 900y — 2070, 
21g = 50625x° — 3375yx* + 30375x4 — 675y7x> + 2025yx? + 2700x7 
+75y*x? + 2025yx* + 5850x7 — 2y*x — 18y3x + 33y2x — S13 yx 
+63x + 15y> — 30y” + 270y + 315. 


Then X, 1, and Xy are given by 


817 45 g10(x + 1) (9x? — 2x + 1) 
=") ’ Y1 = ’ 
86a8 6b 848 6a8 6b 
a goa" +1) a2 8198489 
259? 97, (9x2 — 2x + 1)’ 2592 97, (9x2 — 2x + 1)’ 
aii 1 yo = — 848% 
x41? Sem(x + 1)?" 


The cover 7. Abbreviate (x, y) = (x1, y1) and 


hy = 20° + 4x4 — 12x*y + 8x3 + 9x7y + 16x? — 24xy + 8x + 3y’ 
+18y + 16, 

hog = 4x! + 48x°y + Tx® — 72x? y? + 24x? + 48x4y? — 63x4y? + 288x4y 
+42x4 — 12x73 y4 — 288x3y? + 48x3 — 3x24 + 192x7y? — 252x7y? 
+576x7y + 84x? — 24xy4 — 288xy? + 32x + 3y° — 6y* + 192y° 
—252y* + 384y + 56. 
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Then x is given by 


326 3huy (3x2 — y? +6)" 
— qd => he é 
26 
The cover 7. Abbreviate (x, y) = (x2, y2) and 
fo = 144x7y — 408x7y — 12x? + 8xy? + 388xy + 20x — Oy” — 126y— 9, 


fia = 36x? — 288x7y" — 504x3y + 816x7y? + 1236x7y — 12x” + 2xy? 
—840xy? — 1038xy + 20x — 9y? + 297y? + 297y — 9. 


Then m2 is given by 


aye ; 36x4y? 
a= 7 = 5 
fe, (12x2 — 20x + 9) fis 


Proof. We will sketch our construction only, as there were many complicated 
variable changes to reduce to the relatively concise formulation given in the 
theorem. We first found z as follows. Via (7), the genus three curve Y3(p,q) is 
presented as a quartic cover of Pl, Replacing x by 7” in (7) and factoring, one gets a 
presentation of Y2(p, q) as a double cover of the ¢-line P!: 


y =pt'+3P +3 + qt. 


Three-torsion points on the Jacobian of Y2(p,q) are related to unramified abelian 
triple covers of Y2(p,q). Such a triple cover arises as a base-change of certain 
ramified non-abelian triple covers of P?. 

Consider now a partially-specified triple cover of P/, given by 


(at + wot (bt-Ap)2 + te +Hz+t(d—1 =0. 


The discriminant of this polynomial with respect to zis a septic polynomial in f with 
zero constant term. Setting it equal to k(pt’? + 3 + 3° + qt) imposes the necessary 
ramification condition. It also gives seven equations in the seven unknowns a, b, c, 
d, A, 4, and k, all dependent on the two parameters p and q. 

The equations corresponding to the coefficients of 7”, t*, and ¢° let one eliminate 
d and yz and reduce the remaining equation to 


—9a7h? + 162070 — 729a2 + 18abA2 + 180abA — 486ab + 120acda 
—216ac + 3b7A? + 34b7A + 27b* + 24bcA + 72bc + 32c? = 0. 


The system consisting of this equation and the equations coming from the coeffi- 
cients of f!, 3, , and #’ is very complicated to solve. Nonetheless, one can eliminate 
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all the remaining variables, at the expense of putting in the new parameters x2 and 
y2. Conveniently, p and q enter the final formulas symmetrically and can then be 
replaced by a and b via (5), yielding our presentation of 2. 

Our formula for 9 was then obtained via base-change. To get 7, we first built a 
double cover Xp of Xo which is a Galois sextic cover of the yet-to-be explicitized X,. 
Then we explicitized X; by taking invariants under the Galois action, X; = X /S3. 

Oo 

There is a standard way to pass from bivariate rational functions as in the theorem 
to univariate polynomials which are more traditional in number theory. Namely, 
suppose given a cover of rational surfaces via say u = A(x, y)/B(x, y) and v = 
C(x, y)/D(x, y). Assuming x indeed generates the field extension, one can express 
the cover in terms of x alone via a resultant 


F(u, v, x) = Res,(A(, y) — uB(x, y), C(x, y) — vD(x, y)). 


Carrying this out in our context gives Fo(u,v,x), Fi(p,q,x) and F(a, b,x). 
Expanded out, they have 1606, 772, and 209 terms respectively. Interchanging the 
roles of x and y, one gets polynomials Go(u, v, y), Gi(p, g, y), and G2(a, b, y) with 
4941, 1469, and 951 terms respectively. In general, keeping either just x or just y 
is unlikely to minimize the number of terms. More likely the minimum can only 
be obtained by keeping some third variable z € Q(x, y). There do not seem to be 
standard procedures to find these best variables. 


4.3 L-Polynomials of Deligne-Mostow Covers 
and Their Reduction Modulo 3 


To explicitly illustrate the 3-division nature of the main polynomials F)(p, q,x) and 
F(a, b,x), we pursue the polynomial Fo(u,v,x) describing their common base- 
change. Cubically base-changed to the u-v plane, the Deligne-Mostow covers in 
question after some twisting become as follows: 


Y,(u,v): vyt =x? (e— 13 (ux? + (1—u—v)x +n) (genus 4), (8) 


2 
Yo(u,v): 4yt= ( +2x4+1— =) ( =2c41= ~) (genus 3), (9) 
VU 0) 
E(uv):) y =(@«—Dx + d—u—v)x4+u) (genus 1). 


The quadratic subcover of Y,(u, v) is the elliptic curve E(u, v) while the quadratic 
subcover of Y>(u, v) has genus zero. 

Our monodromy considerations give a relation between Y,(u,v) and Y>(u, v). 
The twisting factors v and 4 in the equations above are included so that we can give 
a clean statement of this relation on a more refined level: 


L,(Y(u, v), x) = L,(¥2(u, v), x)Lp (E(u, v), x). (10) 
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Here uw and v are rational numbers and p is any prime good for all three 
curves. Each L-polynomial L,(Y,x) is the numerator of the corresponding zeta- 
function ¢,(Y,x), obtained by determining the point counts |Y(F,,)| for f up 
through genus(Y). Our computations below obtain this L-polynomial via Magma’s 
command ZetaFunction [2]. 

The factorization (10) has the following explicit form: 


L(Y (u, v), x) = (l+.ax + bx? + ce? + pbx! + p’ax + p*x®)(1 + dx + px’). 


For p = 1 (4), both factors in turn split over Q(i) as the product of two conjugate 
polynomials. For p = 3 (4), the coefficients a, c, and d all vanish, so that each factor 
is an even polynomial. Taking (u, v) = (—4, —3) as a running example, these two 
cases are represented by the first two good primes: 


Ls(¥i(—4, —3),x) = (1 — 2x? — 16x* — 5x4 + 125x°) (1 — 2x + 52”), 
= M(1 + ix— (1 — 213° — (10 + 5i)x*) NU — (1 + 2i)x), 
L(Y (—4, —3), x) = (1 + 5x7 + 35x* + 343x°) (1 + 72°). 


Here and below, N(f) = ff is the product of a polynomial f and its conjugate f. 

Consider now L,(Y2(u, v),x) = N( + ax + Bx? + yx?) in F3[x] for varying 
p = 1 &). To twist into a situation governed by SU3(3) we eplere x by —y>x to 
obtain the modified polynomial L, (Y2(u, v),x) = N(—ay?x+ By* x°—x3) € Fs[x]. 
Similarly, for p = 3 (4), consider L,(Y2(u, v), x) = 1+bx?+bpx ‘tps im F;[x]. To 
twist into a situation governed by S$ U,(3). 2—SU3(3), we replace x? by px” obtaining 
L,(Yo(u, v),x) = 1 + bpx? + bpx* + x® € F3[x]. For 5 < p < 97, the polynomials 
L(Y (—4, —3), x) are calculated directly by ZetaFunction to be 


Class(p) |A28(p) L,(Y2(—4, —3),x) € Fs[x] Primes p 

3B 3°1  N(L— 2°) 89 

TAB 7* —N(1—(L + Det (1 — i)x? — x3) 5, 13,29, 53, 61, 73, 97 
8AB 832 12|N N(1 —ix—ix 2 — x?) 37,41 

12AB_ |12731 Ml + (1=)x= (1 + is? — x3) 17 

6b 6431 |(1+2)° 11,19 

8c 834 (1 + x) (1 +2x+ 2x") (1 +x+ 2x) 43,67, 79, 83 

12c, 12d|1273 1|(1 + x)?(1 + 2x)? (1 +2’) 7,23, 31,47, 59, 71, 


For general (u, v), the fact that Fo(u, v,x) functions as a 3-division polynomial is 
seen by the fact that L,(Y2 (u,v), x) € F3[x] depends only the conjugacy class in I.2 
determined by p. Up to small ambiguities, as described in Table |, this conjugacy 
class is determined by the class of p modulo 4 and the factorization partition A23(p) 
of Fo(u, v, x) € F,[x]. 


186 D.P. Roberts 
5 2-Division Polynomials of Shioda Quartics 


In this section, we recognize m, : X; > U3, and m2 : X2 — U3 as 2-division 
polynomials for certain genus three Shioda curves. The last section calculates some 
sample L-polynomials and illustrates how their mod 2 reductions are determined by 
our equations for the 7;. 


5.1 The Shioda W(E7)* Polynomial 


In [15], Shioda exhibits multiparameter polynomials for the Weyl groups W(E¢), 
W(E7), and W(Eg). He proves in Theorem 7.2 that these polynomials are generic, in 
the sense that any W(E,,) extension of a characteristic zero field F is given by some 
specialization of the parameters. 

The case of W(E7) & W(E7)t x Cp is explained in greater detail in [16] and goes 
as follows. Fix a parameter vector r = (1,13,174,15,16,17,19) € C’ and consider 
the equation 


y =xy+ (w? + raw + r6)x + (r;w* + rw + rsw + rywt ro). (11) 


The vanishing of the right side defines a quartic curve Q, in the w-x plane. The 
equation itself defines a K3 surface in x-y-w space mapping to the w-line with elliptic 
curves as fibers. Now consider the substitutions 


x=wtb, y=cw'+dwte, 


which make each side of (11) a quartic polynomial in w. Equating like coeffi- 
cients, (11) then becomes five equations in the five unknowns z, b, c, d, and e. 
There are 56 solutions, paired according to the negation operator (z, b,c, d,e) 
(z, b, —c, —d, —e). Much of the interest in Shioda’s theory comes from regarding 
these solutions as generators for the rank seven Mordell-Weil group of the generic 
fiber. 

Our interest instead is that the 28 lines x = zw+b are exactly the 28 bitangents of 
Q,. The variables b, c, d, and e can be very easily eliminated and one gets Shioda’s 
degree 28 generic polynomial for the rotation subgroup W(E7)*: 


S(7,z) = 
28 Bri 227 4 727032 + 60rgz4 + (—504rs + 432ryr4)e? + 
(38477 rq — 1248rirs + 540r3 — 540r6)z? +--+: 


Expanded out as an element of Z[r, z] := Z[r1,13,14,15,16,17,19, Z|, there are 1784 
terms. The polynomial is weighted homogeneous when the variable z is given 
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weight | and each parameter 7; is given weight i. The polynomial discriminant of 
S(r, z) factors over Q as A(r) = D(r)C(r)’, with D(r) a source of ramification and 
C(r) an irrelevant artifact of our coordinates. 


5.2 Using I'.2.C W(E7)t 


The group I°.2 is a subgroup of W(E7)* . Since genericity implies descent-genericity 
[11], any degree 28 extension K/F with Galois group I.2 is of the form F[x]/S(r, z) 
for suitable r € F’. For the Malle-Matzat polynomial m(t, x), we considered various 
t € Q and conducted a very modest search over different polynomials of small 
height defining the same field as Q|x]/m(t, x). For a few t, we found a polynomial of 
the form S(r, z) for certain r € Q’. Some of these seven-tuples had similar shapes, 
and interpolating these only we found that the Malle-Matzat family seemed also to 
be given by 


S(0, —2777, —8127, 2438, 2437, —7292*, 7297°, z) = 0. (12) 


The correctness of this alternative equation is algebraically confirmed by eliminat- 
ing ¢ from the pair of Eqs. (3), (12), to obtain the relation 


(x—1)/ (ae + 20x37 + 114x? + 68x + 13). 
(x° — 6x5 — 435x4 — 308x3 + 15x? + 66x + 19)” 
243 (x2 + 4x + 1)8 


z= (13) 


Thus Eq. (12) realizes the Malle-Matzat polynomial as a 2-division polynomial for 
an explicit family of genus three curves. 

The simplicity of the equational form (12) is striking, especially taking into 
account that all the positive integers printed are powers of 3. Expanding the family 
out as a polynomial in Z[t, z] hides the simplicity, as there are 75 terms. 


5.3. A Search for I.2 Specializations 


Given the simplicity of (12), we searched for similar families as follows. We 
considered one-parameter polynomials of the form S(r,z) with r; = a;f*'. Here 
the e; € Zso are fixed and the constants a; yet unspecified. We looked at many 
(€1, €3, €4, €5, €6, €7, €9) hear-proportional to (1, 3,4, 5, 6, 7,9) so as to ensure that 
D(a,t*!,..., dot®?) has the form t“d(t) with d(t) of small degree. When a particular 
exponent e; made a proportionality (e€],...,é@9) « (1,...,9) not so close, we set a; 
equal to zero, rendering e; irrelevant. 
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We then worked modulo 5, letting (a),..., a9) run over relevant possibilities in 
Fi. If k of the a; are set equal to zero, we looked at just 4°" possibilities: we kept 
the other a; nonzero, and homogeneity and the scaling t +> ut each saved a factor of 
4. We examined each one-parameter family S(a,t*!,... , aot’, z) by specializing to 
t € F. and factoring in F,j[z]. In the rare cases when all factorization patterns A. 
for j = 1, 2, and 3 corresponded to elements of [.2, as on Table 1, we proceeded 


under the expectation that S(a)f',...,aof,z) = 0 defines a cover with Galois 
group in [°.2. 

For 15 (e),...,@9) we found exactly one (a},...,d9) which works. For five 
(€1,...,@9) we found several (a},...,@9) which work, suggestive of a two- 
parameter family. We then reinspected these five (e1,..., e9) in characteristic seven, 


imposing also that the covers sought be tame. The case (e1, €3, €4, €5, €6,€7,€9) = 
(0,1,1,*,2,2,2) seemed to give a two-parameter family in both characteristics, 
satisfying the tameness condition at 7; here the * means that we are setting as = 0. 
Standardizing coordinates, the two-parameter families seemed to match well, and 
there remained the task of lifting to characteristic zero. 

We first found that S(1,0, 3¢,0,0,0,—??,z) € Ql[t,z] defines a 3-point cover, 
giving us hope that coefficients might be even simpler than in (12). Finally we found 
a good two-parameter family So(u, v, z) = 0 where 


So(u, v,z) = S_,u—v +1, —3u, 0, u(—u+ v—1),u(—u+v—1),-wv,27. (14 
The discriminant of So(u, v, z) is 
D(u, v) = 2716318 yy? y4 (uy? — 2Quv — 2u + v? — 2v + 1)’ 


times the square of a large-degree irreducible polynomial in Z[u, v]. 


5.4 Explicit Polynomials 


Our computation of So(u, v, z), as just described, is completely independent of the 
considerations of the previous section. In fact we found So(u, v, z) before we found 
its analog Fo(u,v,x) from the previous section. It might have been possible to 
directly descend So(u, v, z) to Si(p, g, z) and S2(a, b, z) below. However instead we 
obtained these new S; from the corresponding F;: we took lots of specialization 
points, applied Pari’s polred to obtain alternate polynomials, selected those that 
are of the form S(r1, 73, 74, 7s, 76,17, 19, Z), and interpolated those that seemed to fit 
a common pattern. 


Theorem 5.1. Abbreviate d = p*q? — 6pq + 4p + 4q — 3, A = 256/a, and B = 
(b—1)/8. The covers mo, 1, and m2 are also given respectively via the polynomials 
So(u, v, z), 
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0, 
ap, 
3d*p*(q — 1), 
Si(P,4,2) = S( 3d p*.2); 
—d’p’ (3p°q? — 9pq + 4q + 2p). 
—3d'*p*(q— 1), 
ap (2pq° —3q+ 1) : 
1, 
3 (AB? +2), 
—3 (8AB? + AB+ 1), 
So(a, b,z) = S( —3 (SAB? + AB—4),2). 


—8A?B* — A?B? — 184AB? — 31AB—A —2, 
—56A2B* — 7A2B? — 199AB? — 58AB — 4A + 10, 
—440A2B* — 103A2B3 — 6A2B2 — 693AB2 — 183AB — 12A 4+ 3, 


Proof. We describe Case 0, as the other cases are similar except that the analog 
of (15) is much more complicated. Analogously to (13), one needs to find z in the 
function field Q(x, y) of Xo satisfying So(u, v, z) = 0. To find a candidate z, one takes 
a sufficiently large collection of {(x;, y;)} of ordered pairs in Q?. One next obtains the 
pairs (u;, v;) = m0(x;, y;). Discarding the very rare cases where So(u;, v;,z) € Q|z| 
has more than one rational root, one defines z; to be the unique rational root of 
So (ui, Vi, Z). The desired z is then obtained by interpolation, being 


_ Bx= Nga (3x — 1) (Oxy? + 18xy — 66x + y? — 18y + 6) iS 
gem 7 225x? — 30xy — 30x —2y?+ 6y +33 


Correctness is confirmed by verifying that So(u(x, y), v(x, y), z(x, y)) indeed simpli- 
fies to zero in Q(x, y). Oo 

Fully expanded out, So(u, v, z), Si(p, q, Zz), and S2(a, b, z) respectively have 551, 
7299, and 1053 terms. Thus given Shioda’s master polynomial S, our S; admit the 
relatively concise presentations given in (14) and Theorem 5.1. Without S, the new 
S; are of comparable complexity to the previous F;, in the sense of number of terms. 


5.5 L-Polynomials of Shioda Quartics 
and Their Reduction Modulo 2 


To illustrate the 2-division nature of the polynomials So(u, v,z), S\(p,q,z), and 
S(a, b,z), one could take any parameter pair for which the corresponding poly- 
nomial is separable. As in Sect. 4.3, we work with (u, v) = (—4, —3). 
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The images of (u, v) in the lower planes are (p, g) = 0, (—4, —3) = (—12, —3/4) 
and (a,b) = o2(—4,-3) = (192,9). By plugging into the three parts of 
Theorem 5.1, and scaling by r; +> r;/9! in the middle case, one gets indices 


b(-4,-3) = (1, 0, 12, 0, 0, 0, —16), 
I,(—12,-3/4) = (0, -12, -84, -144, 720, -1008, 7872), 
1(192,9) = (1, 10, —39, —12, —306, —450, —2157). 


Taking these vectors as (11, 73,14,15,16,17,19) and substituting into the right side 
of (11), one gets three quartic plane curves, to be denoted here simply Qo, Qi, 
and Q>. 

As in Sect. 4.3, each of the genus three curves Q; has good L-polynomials 


L,(Qi,x) = 1 + ax + bx + cx? + pbxt + pear + px, 


Using Magma’s ZetaFunction again, and taking the first two good primes in 
each case, one gets 


L5(Qo,x) =14tx+3rP 44-5, L1(Qo.x) =1-xt+4P-1Ie +e, 
L5(Q\,x) =14+x+3r 4042+, L7(Q1.x) =1-x4+ 8-4-5, 
Ls(Qo,x) =1l4+x4+x2 4118 4+---, L7(Qo,x) =1—x4+8P—04+---. 


One has coincidences L5(Qo,x) = Ls(Q,x) and L7(Q),x) = L(Qo,x), with the 
second polynomial being reducible: (1 — x + 7x*)(1 + x* + 49x+). The generic 
behavior is that all three L,,(Q;, x) are different and their splitting fields are disjoint 
extensions of Q, each with Galois group the wreath product S> 2 S3 of order 48. 

The behavior of the curves here differs sharply from the behavior of the curves in 
Sect. 4.3. To describe this difference, we will use the language of motives, referring 
to the unconditional theory of [1]. Note however, that the language of Jacobians 
would suffice for the current comparison. Similarly, one could use the alternative 
language of Artin representations for Sect. 6.3. But for uniformity, and certainly to 
include the general case as represented by Sect. 6.4, the language of motives is best. 

The difference between they Y; of Sect. 4.3 and the Q; here goes as follows. The 
two curves Y; from Sect. 4.3 give rise to a single rank six motive M = H!(¥2,Q) C 
H!'(¥;, Q). Moreover the potential automorphism (x, y) +> (x, iy) causes the motivic 
Galois group of M to be the ten-dimensional conformal unitary group CU3.2. In 
contrast, the motives M; = H'(Q;, Q) here are all different, as is clear from their 
different L-polynomials. Moreover, their motivic Galois groups are all as big as 
possible, the full 22-dimensional conformal symplectic group CSpe. 

While the different L,(Q;,x) € Z[x] have very little to do with each other, their 
reductions to F,[x] coincide, as illustrated with primes 5 < p < 97: 
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Class(p) Ag (p)|Lp(Qi,x) € Fob] Primes p 
3B. (31 [x +02(2 +241) [89 
7AB 7 (x3 +x+ 1) OF x 4 1) 15,13, 29,53,61,73, 97 


8AB 8321? \(x +. 1) 37, 41 

12AB 12231 (x? +x+1)° 17 

6b 6431. (x + 12(2@ 4x41) [11,19 
884 (x + 178 43, 67, 79, 83 

12c, 12d/12?31 |(x? +x+1)° 7,23, 31,47, 59, 71 


This table shows very clearly how So(—4, —3, z) functions as a 2-division polyno- 
mial. All three S;, arbitrarily specialized, similarly capture the mod 2 behavior of 
corresponding L-polynomials. 


6 2-Division Polynomials of Dettweiler-Reiter G2 Motives 


This section explains how the cover 2; : X; — U3,,; is related to rigidity in the 
algebraic group G2 in two ways. The last section presents some sample analytic 
calculations with L-functions. 


6.1 Rigidity in General 


In the mid 1990s, Katz [10] developed a powerful theory of rigidity of tuples 
(g1,..-,&z) Satisfying g;---g, = 1 in ambient groups of the form GL,(E), with E 
being an algebraically closed field. There is presently developing a theory of rigidity 
of tuples in G(E) for other ambient algebraic groups G; particularly relevant for us 
is [6], where G is either Gz or SO7. In general, if G is simple modulo its finite center 
we say that a tuple (C),..., C,) is numerically rigid if 


y edg(Ci) = (z— 2) dim(G). (16) 


i=1 


Here for C; a conjugacy class containing an element g;, the integer cdg(C;) = 
cdg(g;) is the dimension of the centralizer of g; in G(E). 

The Malle-Matzat case provides a convenient example in Katz’s original context. 
As explained in [13, Sect.8], after a quadratic base change the class triple 
(4b, 2b, 12AB) becomes (12A,2A,12B) in = SU3(F3). Pushed forward to 
SL3 (F3), the classes 12A and 12B are regular and so have centralizer dimension 
rank(SL3) = 2. The class 2A is a reflection and has centralizer GL2(F3) with 
dimension 4. The rigidity condition (16) becomes 2 + 4+ 2 = 1 - 8 and is thus 
satisfied. 
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-2 


Fig. 3. The image of the class-set G3(2)" inside the class space G 


6.2 Groups G2(2) and G2-Rigidity 


Our group J.2 = G2(2) embeds into the fourteen-dimensional compact Lie group 
G;. Figure 3 illustrates the associated map Gz (2)5 > GS on the level of conjugacy 
classes, which is no longer injective. The fundamental characters x and ¢ of G2 
have degrees 7 and 14 respectively, and the set Gt becomes the indicated triangular 
region in the y-¢ plane. The unique class in G2(2)! which is outside the window is 
the identity class 1A at the point (7, @) = (7, 14). 

We have drawn Fig. 3 to facilitate the analysis of rigidity in G(C). First consider 
classes which intersect the compact group G5, and are thus represented by points 
in the closed triangular region drawn in the figure. If g represents one of the vertex 
classes labeled by 1A, 3A, and 2A, b respectively, its centralizer has type G2, SLs, 
and SL, x SL, thus dimension 14, 8, and 6 respectively. For g representing a class 
otherwise on the boundary, the centralizer has type GL and hence dimension 4. For 
g in the interior, the class is regular and so the centralizer dimension is rank(G2) = 
2. For general semisimple elements in G2(C) the situation is the same: centralizer 
dimensions are 14, 8, and 6 for the three special classes already considered, 4 for 
classes on the algebraic curve corresponding to the boundary, and 2 otherwise. 


6.3 Gy-Rigidity of (3A, 3A, 3A, 4B) 


The first-listed quadruple for 7; in Proposition3.1 is (3A, 3A, 3A, 4B). Using the 
determinations associated to Fig. 3, the left side of (16) becomes 8+8+8+4 = 28 
which agrees with the right side (4 — 2)14 = 28. Thus (3A, 3A, 3A, 4B) is G-rigid. 
We are not pursuing this connection here, but it seems possible to write down a 
corresponding rank seven differential equation with finite monodromy. From the 
algebraic solutions to this differential equation, one could perhaps construct the 
cover X; > U3; ina third way. 
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6.4 Orthogonal Rigidity of a Lift of (2A, 2A, 3A, 4A) 


The last-listed quadruple for mz; in Proposition3.1 is (2A, 2A, 3A, 4A). This tuple 
fails to be G2-rigid, as now the left side of (16) is 6 + 6+ 8 + 3 = 24 which is less 
than 28. However one does have rigidity of a lift as follows. 

The following matrices were sent to me by Stefan Reiter in July 2013. 


1 1 1 
1 1 
1 1 1 
a=|-3 1 1 ~ 1 
3 -1 1 1 
9 -3 1 1 
1 3-12-11 1 
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These matrices satisfy abcd = | and they generate a subgroup of GL;(C) with 
Zariski closure of the form G(C). On the one hand, reduced to GL7(F)), these 
matrices generate a copy of G(2)’ with a, b, c, and d respectively in 2A, 2A, 3A and 
4A. On the other hand, considered in GL7(C), the matrices have Jordan canonical 
forms as listed on the right, with @ = exp(2zi/3). 

Consider a,b,c,d € Gy(C) C $O7(C) C SL;(C). Centralizer dimensions are 
calculated in [6, Sect. 3] and the numerics associated with (16) are as follows. 


G \cdg(a) + cdg(b) + cdg(c) + cdg(d) 2 dim(G) 
G) 8 + 8 + 8 + 4 = 2 = 28 
So, 13 + 13 + 9 + 7 = 42 = 4 
SL; 28 + 28 + 28 + 16 = 9 < 9% 


Thus the quadruple ([a], [b], [c], [d]) is G2(C)- and SO7(C)-rigid. However it is not 
SL7(C)-rigid, and so does not fit into Katz’s original framework. 

Dettweiler and Reiter classify tuples of classes in G2(C) which are SO7(C) rigid 
in [6]. Thus ({a], [b], [c], [d]) is in their classification. In fact, it appears as the first 
line of the table in Sect. 5.4. Being SO7(C)-rigid is a stronger condition than being 
G2(C)-rigid. It implies from [6] that there is a corresponding rank seven motive over 
Q(p, g) with motivic Galois group G2. 


6.5 Division Polynomials and L-Functions 


In Sects. 4.3 and 5.5 we have discussed L-polynomials L,(M, x) for certain motives 
M = H'(curve,Q). Putting these L-polynomials together, including also L- 
polynomials at bad primes, one gets a global L-function 


L(M,s) =| ][L,(.p~)'. (17) 
Pp 


This L-function is expected to have standard analytic properties, including an 
analytic continuation and a functional equation with respect tos «~ 2—. 
Normalizing the motives from Sects. 6.3 and 6.4 to have weight 0, one likewise 
expects good analytic properties of corresponding L(M, s), involving now functional 
equations s <> 1 —s. 

We do not know yet how to compute L-polynomials in the context of Sect. 6.4, 
where the motivic Galois group is generically the fourteen-dimensional algebraic 
group Gz. However the computation of L-polynomials is feasible in the setting of 
Sect. 6.3 where the motivic Galois group is just the finite group G2(2). In fact, as 
commented already in Sect. 5.5, we are using motivic language mainly because it is 
the natural general context for division polynomials. The particular motives from 
Sect. 6.3 correspond to finite-image Galois representations and so this language 
could be avoided. 
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In the setting of Sects. 4, 5, and 6.3, analytic computations with global L- 
functions (17) are possible on a numerical level. To illustrate this, we consider the 
motive M from Sect. 6.3 associated to the specialization point used in Sects. 4.3 and 
5.5, namely (u,v) = (—4, —3). This motive corresponds to the seven-dimensional 
irreducible representation of G2(2) into SO(7). It is natural here to twist by the 
Dirichlet character y given on odd primes p by y(p) = (—1)~/. The twisted 
motive M’ corresponding to the other seven-dimensional irreducible representation 
of G2(2). At the level of good L-polynomials, passing back and forth between M 
and M’ means replacing x by y(p)x. 

Let p = 5 be a prime. The corresponding Frobenius class Fr, can usually be 
deduced from Table 1 from the mod p factorization partition of So(—4, —3, z) and 
the class of p modulo 4. To make the necessary distinction between 3A and 3B, 
we use the factorization partition of the resolvent f36(4, x) presented in (19). The 
(x, ¢)-coordinates of Fr, on Fig. 3 then yield the L-polynomial 


L,(M,x) = 1—ax+ bx? — 0x + cx? — bx? + ax® — x’. 


Herea = y,b= y+, andc=a+a’—b. 

The necessary 2-adic and 3-adic analysis for obtaining conductors and bad L- 
polynomials is begun in Proposition 8.2 below. For L(M, s) the conductor is 27°3!?, 
the decomposition of the exponents as a sum of seven slopes being as follows. 


11 
At2: 20=6:-342. At 3: ea ta 


Since all slopes are positive, the bad L-polynomials are L2)(M,x) = L3(M,x) = 1. 
For L(M’, s), the slopes are all the same except the 2-adic slope 2 is now 0, so that 
the conductor drops to 2!83!?. Slopes of 0 contribute to the degree of L-polynomials, 
and in this case L7(M’, x) = 1 — x while still L3(M’, x) = 1. 

In principle, Magma’s Artin representation and L-function packages [2], both 
due to Tim Dokchister, should do all the above automatically, given simply 
So(—4, —3, z) as input. However the inertia groups at 2 and 3 are currently too large, 
and so Magma can only be used with the above extra information at the bad primes. 
It then outputs numerical values for arbitrary s, on the assumption that standard 
conjectures hold. Particularly interesting s include those of the form 5 + it with t 
real, i.e. those on the critical line. Here one multiplies L by a phase factor depending 
analytically on ¢t to obtain a new function L* taking real values only. Figure 4 
presents plots for our two cases, numerically identifying zeros on the critical line. 

To obtain analogous plots of L*(M, wil + it) for a general weight w motive, 
such as the weight one motives from Sects. 4.3 and 5.5, division polynomials do not 
at all suffice. Here one needs the much more complete information obtained from 
point counts, like the L,(M, x) presented in Sects. 4.3 and 5.5 for p = 5 and p = 7. 
However division polynomials can still be of assistance in obtaining the needed 
information at the bad primes. 
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Fig. 4. Graphs of L*(M, 5 + it) (left) and L*(M’, 5 + it) (right) 


7 Specialization to Three-Point Covers 


In Sect.7.1 we find projective lines P in U3,, and U3, suitably intersecting the 
discriminant locus in only three points. In Sect. 7.2 we consider the covers obtained 
by the preimages under zr, and zz of these lines. We thereby construct some of the 
three-point covers Xp —> P mentioned in Sect. 3.1. As stated previously, it would be 
hard to construct these covers directly because these Xp always have positive genus. 
In Sect.7.3 we apply quadratic descent twice to a cover Xp — P coming from a 
curve P C U3). and recover the Malle-Matzat cover (3). 


7.1 Curves in U334 and U32 


The top half of Fig. 5 is a window on the real points of the naive completion U3; = 
PB x re. The discriminant locus Z3 ;,; consists of the two coordinate axes, the two 
lines at infinity, and the solution curve D, of 


pg —6pq+4p+4q-3=0. 


The five lightly drawn straight lines intersect Z3) ; in just three points, not counting 
multiplicities. The ten other lightly drawn curves have the same three-point property, 
although it is not visually evident. The points drawn in Fig.5 will be discussed in 
the next section. 

The bottom half of Fig.5 names and parametrizes the 15 lightly drawn curves 
in the top half. Each name is a superscripted letter. The five bulleted curves are the 
straight lines. There are other natural coordinate systems on the p-q-plane, and each 
of the other curves appears as a line in at least one of these coordinate systems. We 
are emphasizing the coordinates p and q because they make the natural involution 
of U3,;,, completely evident as p <> qg. The three curves labeled T* are stable under 
this involution. The remaining 12 curves form six interchanged pairs: T’ <> T”. Six 
of the 15 curves are images of lines in the cubic cover U. These source lines in U 
are indicated by a, b, c, d, e, andf. 
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3 2 1 0 1 2 3 
Pp q P q 

G' e t 1 Ge 1 t 
H' e 8t/9 3/4 H" e 3/4 8t/9 
I’ (9t — 5)/4 1/p y" 1/¢ (9t — 5)/4 
J (t+ 3)/4 (4p — 3)/p? J” (4q—3)/q? (t+.3)/4 
K' al (t+2)/3 (3p — 2)/p? K" ce] (3q—2)/q?_ (t+.2)/3 
L' e}3(t+4)/16 9(4p — 3)/8p? L" dj 9(4q— 3)/8q? 3(t + 4)/16 
F* ef| 3/(4t—1) 3/(4t — 1) 
M* t 1/t 
B* b —3t —3/t 


Fig. 5 Top: the p-g plane U3, ;(R). Discriminant loci (thick), bases of three-point covers (thin), 
and specialization points are drawn in. Bottom: parametrizations of the bases for three-point covers 


Figure 6 is the analog of Fig.5 for U32 = Pe and we will describe it more 
briefly, focusing on differences. The discriminant locus Z3 has four components, 
the two coordinate axes, the line at infinity, and the curve D> with equation 


@ —2ab? + 12ab + 6a + b* — 12b? + 30b* — 286+ 9 = 0. 


The light curves each intersect the discriminant locus in three points, where this 
time a contact point with D2 does not count if the local intersection number is 
even. Despite the relaxing of the three-point condition, we have found only 12 such 
curves. The five curves A, B, C, E, and F are images of generically bijective maps 
from curves a, b, c, e, and f in U. Curve d in U double covers B, and so does not 
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5 1. L 1 1 1 1 1 1 1 1 1 
30 —20 —10 0 10 

a b | a b 
A 27(b — 1)°/(5b—9) 9(t—1)/(5t—9) G (6-1)? t 
Be —48(t — 1) 9 H b(3b — 2)/3 2t/3 
C | -(b-—9)(5b—9)/27 9(t — 1)/(5t — 1) I} (b—1)?(5b+4)/b 4/(9t —5) 
E (b—1)8/(b—3) 3(#-1)/(t-3) J | —(b—1)?b/3(b—4) 4t/(t+3) 
F 3/(9t2 —7t +1)” qt K) —(6—1)°/(b— 3) 3¢/(¢ +2) 
Me —16t ot L =b7(10b-9) 9t/2(t + 4) 


27(2b—9) 


Fig. 6 Top: the a-b plane U3,>(R). Discriminant loci (thick), bases of three-point covers (thin), and 
specialization points are drawn in. Bottom: parametrizations of the bases for three-point covers 


have its own entry on Fig.6. For T = G, H, I, J, K, and L, the curve T C U3 
comes from 7’ and T” in U3,,,; via (5). Finally M C U3, is double-covered by M* 


in U314. 


7.2 Three-Point Covers with Galois Group I.2 


The previous section concerned the base varieties U3; and U3 only. For quite 
general covers X — U,, one gets three-point covers Xp — P by specialization to 
the P Cc X, listed there. We now apply this theory to our particular covers 7 : X; > 
U31,1 and m2 : X2 — U3. Because of the explicit parametrizations in Figs. 5 and 6, 
our bases are now coordinatized projective lines P! = P!. 
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Table 2 Sixteen three-point covers obtained from zr; and 7 by specialization 


Xo| X31 | X32 | Co | Ci |Coo} g28 | 836) WM |e 
H” 4A | 4B | 3B |-—|——| 0.3 |0 
qf 4A |12A| 2A |——|——| 0.3 |0 

b| B* B |\6A|2A) 8A} 1 Oo; 1 ji 

M |\12A} 2A | 8B | 2 2} 1 fl 
G | 4A | 6A | 3B} 2 2} 1 =f1 
H’,G” 12A| 4A | 3B | 2 5} 1 ]1 

e L’ |E,K\4C | 4A | 8A | 3 3} 1 |1 
GC H | 3A |12A| 3B) 3 5} 1 ]1 

a\ K’ A |4A| 8A} 8B) 4 7} 1 fi 

c| K” |C,I|3A|8A|6A| 4 6; 1 jl 

d| L” 6A | 4A | 6A) 4 5} 1 |1 

f | F*,I'| F | 4A) 8B/12B| 5 8} 1 jl 
J’ 4A }12A) 8B | 5 8} 1 jl 

L \12A| 3A | 8A} 5 8} 1 jl 
M* J | 6A /12A| 8B} 7 10} 5 |5 
J” |__[t2a}izal 6a | 8 | 11/4.083/3 


Table 2 gives the results. The first two lines illustrate the general phenomenon 
where Galois groups sometimes become smaller under specialization. Here the 
covers have Galois groups of order 216 and 432 respectively, thus of index 56 and 
28 in I.2. The covers Xz — P! each split into a genus one cover X77 > P! and the 
trivial cover P! > P!, 

The next fourteen lines each give a cover X32, —> P! with Galois group all 
of 1.2. They are sorted by the genus gg of this cover. In most cases, more than 
one base curve P! yield isomorphic covers, after suitable permutations of the three 
cusps {0, 1, co}. The local monodromy classes in J” always correspond to the first- 
listed parametrized base. These classes are unambiguously determined, except for a 
simultaneous interchange 4A < 4B, 8A < 8B, 12A <— 12B, coming from the outer 
automorphism of J”. We always normalize by making the first-listed interchanged 
class have an A in its name. 

Thus for example, specializing S;(p,q,x) at (p,q) = (—3t, —3/t) from the B* 
line of Fig. 5, one gets a polynomial in Z[t, x] with 554 terms. The local monodromy 
partitions are (6A,2A,8A) as printed. Alternatively, specializing S2(a,b,x) at 
(a, b) = (—48(t—1), 9) from the B line of Fig. 6, one gets a polynomial in Z[t, x] now 
with 252 terms. The monodromy partitions are the same, except for the reordering 
(Co, C1, Coo) = (2A, 6A, 8A). 
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Having specialized from two parameters down to one, it is now much more 
reasonable to print polynomials giving equations fog(t,x) = 0 and fa6(t,x) = 0 
corresponding to the covers in any of the last fourteen lines of Table 2. We do this 
only in the case where genera are the smallest, namely the third line: 


Sas(t, x) = 
—t (3x* — 252x3 + 222x? — 692x — 5)- 


(81x!? + 2106x!! + 26001x!° + 73332x? + 268515x* + 574938x’ 
+618759x° + 400896x° + 184140x4 + 52752x3 + 8952x? + 576x — 32)” 
+2!(1 — (4x + 1) (9x4 + 18x39 + 48x? + 18x + 1)° 


+3°(1 — Ot — 2)8x? (x2 + 8) (2 —2x-1)*, (18) 


fa6(t, x) = 
(4x4 — 3)° (4x — 12x? + 12x —3)° 
3G = 1)" Gi = 1) (@e =o)", (19) 


Here the genera, namely (g2, g35) = (1,0), are the reverse of those of the Malle- 
Matzat covers. 


7.3 Recovering the Malle-Matzat Cover 


The Malle-Matzat cover can be constructed from the last line of Table 2 via two 
quadratic descents as follows. The given cover X; > pl has ramification invariants 
(12A, 12A, 6A). Quotienting out by the involution t < 1 — tf on the base and its 
unique lift to X;, one gets the descended cover X. > P!, with s = 4¢(1 — 1). 
The ramification invariants of this cover are (12A, 2A, 12B). Quotienting now by 
s <> 1/s on the base and its unique lift to X2, one gets the twice descended cover 
x3 > pl, with u = —(s — 1)?/4s. The ramification invariants of this cover are 


(4b, 2b, 12AB), showing that it is the Malle-Matzat cover. 
In other words, 


(2r—1)4 lor t4+3 
m\ ———,x and S| Jes Ga = 
16(t— 1)t (t+3)2 4 
are two different polynomials defining the same degree 28 extension of Q(t). The 


left one is a quartic base-change of the Malle-Matzat polynomial m(u, x) while the 
right is a specialization of S(p, q, Z). 
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8 Specialization to Number Fields 


In this final section, we discuss specialization to number fields with discriminant of 
the form 2/3*. Section 8.1 discusses fields obtained by specializing the 1;. Section 
8.2 continues this discussion, involving also similar fields from other sources. 
Section 8.3 discusses analysis of ramification in general, with a field having Galois 
group PGL2(7) serving as an example. Section 8.4 concludes by analyzing the 
ramification of a particularly interesting field with Galois group $U3(3).2 ~ G2(2). 


8.1 Specializing the Covers 1; 


In this section, we restrict attention to number fields with Galois group [7.2 and 
discriminant of the form 2/3. Consider first the cover Xy) > U. We have found 216 
ordered pairs (u,v) such that the corresponding number field Q[x]/Fo(u, v, x) has 
Galois group [2 and discriminant of the form 2/3*. Different specialization points 
can give isomorphic fields, and we found 147 number fields in this process. 

Next consider the covers 7, : X; — U31,, and m2 : X2 — U32. Beyond images 
of specialization points in U(Q), we found 248 pairs (p,q) and 177 pairs (a, b) 
giving fields with Galois group [2 and discriminant of the form 2/3. We obtained 
62 new fields arising from both covers, 95 new fields arising from zr, only, and 72 
new fields arising from z2 only. Thus we found in total 376 fields with Galois group 
I.2 and discriminant of the form 2/3*. 

Figure 7 indicates the pairs (j, k) arising from field discriminants 2/3* of one of 
these 376 fields. The area of the disk at (j, k) is proportional to the number of fields 
giving rise to (j,k). In 36 cases, this field is unique. The largest multiplicity is 19, 


70F e ° eo -e e 
! -@ ee - @6@ e °°e@0e e 
605 ° $3: ° 3388 3 
L ee e °-e@e e 
50 F : a 
E e ee0e 
40 F = 
[ fe ° 
30F ° 
206 
10F 
ot i i i L i i i \ \ \ \ 1 fl \ fl L \ \ . \ 
0 20 40 60 80 100 


Fig. 7 Pairs (j,k) arising from field discriminants 2/3‘ from specializations of F,(p,q,x) and 
F(a, b, x) 
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arising from (j,k) = (106, 66). The smallest discriminant is 2°°3*°, coming from 
just one field. This field arises from eight sources, 


1 1 32 49 
(u,v) = (—4, —3), ( > i) (5.3). (4, —3), (—32, 1), (- 81 a): 


1 
Ga (1, ;); (20) 


(a,b) = (- 7-3): 


The largest discriminant 2!°63”° arises from four fields. 

The phenomenon of several specialization points giving rise to a single field 
is quite common in our collection of covers z;. The octet in (20) is the most 
extreme instance, but there are many other multiplets as altogether 216 + 248 + 
177 = 641 different specialization points give rise to only 376 fields. This 
repetition phenomenon is discussed for a different cover in [13, Sect. 6], where it is 
explained by a Hecke operator. It would be of interest to give a similar automorphic 
explanation of the very large drop 641 — 376. Ideally, such a description would 
follow through on one of the main points of view of Deligne and Mostow [4, 5], by 
describing all our surfaces via uniformization by the unit ball in C”. 


8.2 Summary of Known Fields 


We specialized the Malle-Matzat cover in [13, Sect.8] to obtain fields with 
discriminant of the form 2/3*. From t = 1/2 we obtained a field with Galois group 
I’, while from 41 other ¢ we obtained 41 other fields with Galois group 1.2. While 
in our covers 7; the .2 always corresponds to the quadratic field Q(i), in the Malle- 
Matzat cover general Q( J9) arise. 

Sorting all the known fields by 0 € Q*/Q*?, including two additional fields from 
[14] with 0 = 2 and 0 = 6, one has the following result. 


Proposition 8.1. There are at least 409 degree 28 fields with Galois group I" or 
I.2. and discriminant of the form 2/3". Sorted by the associated quadratic algebra 
Q[x]/ (x? — 9), these lower bounds are 


aé);-6 -3 —2 -1 1 2 3 6 
5 6 6 381 1 7 2 «41. 


Two aspects of our incomplete numerics are striking. First, it is somewhat surprising 
that there are at least 408 number fields with Galois group /.2 and discriminant 
DZ", By way of contrast, the number of fields with Galois group $7, Sg, and S» and 
discriminant -+2/3* is exactly 10, at least 72, and at least 46 respectively [9]. Second, 
the imbalance with respect to 0 is quite extreme. We have not been exhaustive in 
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specializing our covers and we expect that the 381 could be increased somewhat. 
By exhaustively specializing Shioda’s W(E7)* family, in principle one could obtain 
the correct values on the bottom row. Our expectation however is that most fields 
have already been found and so the imbalance favoring Q(i) is maintained in the 
complete numerics. 


8.3 Analysis of Ramification 


In general, let K be a degree n number field with discriminant d and root 
discriminant 6 = |d|'/". It is important to simultaneously consider the Galois 
closure K®", its discriminant D and its root discriminant A = |D|!/". For a given 
field K, one has 8 < A. To emphasize the fact that the large field K®*! is never 
directly seen in computations, we call A the Galois root discriminant or GRD of 
K. A GRD A is typically much harder to compute than the corresponding root 
discriminant 6, as it requires good knowledge of higher inertia groups at each 
ramifying prime. 

For sufficiently simple K, ramification is thoroughly analyzed by the website 
associated to [7], and the GRD A is automatically computed. The 409 fields K 
contributing to Proposition 8.1 are not in the simple range, and we will present one 
ad hoc computation of a GRD A in the next section. As an illustration of the general 
method, we first consider an easier case here. 

For the easier case, take t = —1 in (18), which corresponds to (p,q) = (3,3) 
via B* and (a,b) = (—48,9) via B, both of which come from (u,v) = (1,2). 
The discriminant and root discriminant of K = QJ[x]/fog(—1,x) are d = 2374 
and 6 = 25.007 respectively. This root discriminant is much smaller than the 
minimum (2%3*)!/28 ~ 31.147 appearing in Sect. 8.1. The field K was excluded 
from consideration in Sect. 8.1 because the Galois group is not /.2 but rather the 
336-element subgroup PGL»(7). This drop in Galois group is confirmed by the 
factorization of the resolvent into irreducibles: fgg(—1, x) = xfta(x)fo1 (x). 

The group PGL,(7) can be embedded in Sg, which means that K®*! can also 
be given as the splitting field of a degree eight polynomial. Such a degree eight 
polynomial was already found in [8, Table 8.2]: 


f(x) = 2° — 6x4 — 48x — 7227 — 48x — 9. 


The analysis of ramification is then done automatically by the website associated to 
[7], returning for each prime p a slope content symbol SC, of the form [s1,..., sg]''. 
This means that the decomposition group D, has order p‘tu, the inertia subgroup 
I, has order p*t, and the wild inertia subgroup P, has order p*. The wild slopes s; 
are then rational numbers greater than one measuring wildness of ramification, as 
explained in [7, Sect. 3.4]. 

In our PGL2(7) example, also taking weighted averages to get Galois mean slope 
[7, Sect. 3.7], the result is 
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SG = [2,3,7/2, 9/2]; Oia + = 
a i i 16 8 ae a ae} 


6 
sc; =[Jf, GMS; = = 


The Galois root discriminant is then A = 279/83°/7 ~ 31.637. This Galois root 
discriminant is the fifth smallest currently on [9] from a field with Galois group 
PGL,(7). 


8.4 A Lightly Ramified Number Field 


Let K be the number field coming from the eight specialization points (20). 
Applying Pari’s polredabs [17] to get a canonical polynomial, this field is 
K = Qhx]/f(x) with 


fQ@) = 
x78 — 4x77 4 18x7° — 60x? + 165x74 — 420x73 + 798x*? — 144027! 
+2040x7° — 2292x)9 + 2478x!8 — 756x!7 — 657x!6 + 1464x)5 — 4920x!4 
+3072x'3 — 1068x'? + 3768x!! + 1752x!° — 4680x? — 1116x* + 672x7 
+1800x° — 240x° — 216x* — 192x7 + 24x? + 32x +4 4. 


The field K arises from (18) with either t+ = 4 or t = 32/81, so we also have 
its resolvent K36 = Ql|x]/fs6(4, x) from (19). Since one of the eight specialization 
points in (20) is (u,v) = (—4, —3), we have also seen this field already in the three 
sections about L-polynomials, Sects. 4.3, 5.5, and 6.5. ¥ Let K®! be the splitting 
field of K. Calculation of slope contents is not automatically done by the website of 
[7] because degrees are too large. The proof of the following proposition illustrates 
the types of considerations which are built into [7] for smaller degrees. 


Proposition 8.2. The decomposition groups of K®* at the ramified primes have 
invariants as follows: 


43 
16’ 
i? 8 1 Oi. 1s 
78° 078 136) 2 


7 3 
SC> = [2,2,2, 3, 3]3, GMS, = —-2 3= 
2= I a ae 


SC3 = [13/8, 13/8, 11/6];, GMS; = 


Thus the root discriminant of K®"! is A = 243/1©3!?9/ ~ 43.386. 


Proof. The computation is easier at the prime p = 3 and so we do it first. The field 
K factors 3-adically as K27 x Q3 with K7 having discriminant 3°°. The exponent 
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arises from three slopes 5; < 52 < 53 via 46 = 2s; + 652 + 1853. One of the 
two degree 63 resolvents, computed by Magma, factors 3-adically as K54 x Ko, with 
Ky = Q3[x]/(@? + 6x° + 6) having slope content [13/8, 13/8]%. This forces the 
remaining slope of K27 to be s3 = 11/6. The inertia group D3 is thus the maximal 
subgroup 31,'* : 8 : 2 of I.2, with slope content [13/8, 13/8, 11/6]. 

Moving on to the prime 2, the field K factors 2-adically as Kj6 x Ki2. Here K6 
is totally ramified of discriminant 2**. The complement Kj, contains the unramified 
cubic extension of Q> and has discriminant 274. Since the group Si¢6 X (S42 C3) 
does not contain an element of cycle structure either 8°21 or 8°4, the decomposition 
group D» cannot contain an element of order eight. Thus D2 cannot contain a Sylow 
2-subgroup of I°.2. So even though ord2(|/.2|) = 6, there can be at most five wild 
slopes. 

The resolvent K35 factors as Ki6 x Ki. Kg, with Kg & Qo[x]/(x8+2x’+2) having 
discriminant 2'4 and slope content [2, 2, 2}}. Thus we have found three slopes to be 
2, 2, and 2. If we can find two more wild slopes we will have identified all wild 
slopes. 

The field Kg and the sextic field Ke = Qo[x]/(x° + x* + 1) with discriminant 2° 
have the same splitting field. The latter is a subfield of Kj. showing that (24—6)/6 = 
3 is a fourth 2-adic slope. In fact, since both involutions in .2 have cycle type 2!714 
and therefore must appear in the degree 12 factor, 3 is the largest wild slope. 

The quartic subfield of Kg is Ky = Qo[x]/(x* + 2x? + 2x? + 2) with discriminant 
2°. Computation shows it is a subfield of Ki¢. So the remaining slope s satisfies 
1-24+2-2+ 4-5 + 8-3 = 42 and must also be 3. The tame degree /)/P2 can only 
be 1, as the only other possibility t = 3 would force u = 2 and I.2 does not contain 
a solvable subgroup of order a multiple of 2°37 = 576. Thus D> has order 96 and 
slope content [2, 2, 2, 3, 3]}. Oo 

The Galois root discriminant A ~ 43.386 is very low, as is clear from 
the discussion in [8], as updated in [9, Table 9.1]. In fact, the field K®"! is a 
current record-holder, in the sense that all known Galois fields with smaller root 
discriminants involve only simple groups of size smaller than 6048 in their Galois 
groups. 
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A Variant of Weyl’s Inequality for Systems 
of Forms and Applications 


Damaris Schindler 


Abstract We give a variant of Weyl’s inequality for systems of forms together 
with applications. First we use this to give a different formulation of a theorem of 
B.J. Birch on forms in many variables. More precisely, we show that the dimension 
of the locus V* introduced in this work can be replaced by the maximal dimension 
of the singular loci of forms in the linear system of the given forms. In some cases 
this improves on the aforementioned theorem of Birch. 

Second, we improve on a theorem of W.M. Schmidt which states that the number 
of integer points inside a given box, that lie on the variety given by a system 
of homogeneous forms of the same degree, satisfies the asymptotic behaviour as 
predicted by the classical circle method, as soon as the so called h-invariant of the 
system is sufficiently large. In this direction we generalise previous improvements 
of R. Dietmann on systems of quadratic and cubic forms to systems of forms of 
general degree. 


1 Introduction 


We consider a system of homogeneous forms fj(x1,...,%») € Zl[x1,...,X,] of 
degree d. For convenience we write x = (x),...,X,) and ask for the number of 
integer solutions to the system of Diophantine equations given by 


fix) =0, l<i<r. 
More precisely, we fix a box 4 C R” which is contained in the unit box centred at 
the origin and we let P > 1 be some real parameter. Then we define the counting 


function 


N(P) = i{x € Z" : x € PY, f(x) =0, 1 <i< rt. 
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This counting function has received a lot of attention (see for example [2, 10]), 
and is a central object of investigation in number theory. If the number of variables 
n is relatively large compared to the number of equations and the degree d, then 
the Hardy-Littlewood circle method has proved to be a valuable tool in obtaining 
asymptotic formulas for the counting function N(P). 

A very general result in this direction has been obtained by Birch in [2]. He 
introduces a locus called V* which is the affine variety given by 


ank ( ofi(x) ) <r 
ax; J \sisr 
= 


In his work [2] Birch provides an asymptotic formula for N(P) as soon as 


n—dimV* > r(r + 1)(d—1)27 1. 


A main ingredient in most applications of the circle method, as for example that 
in [2], is a form of Weyl’s inequality. In this paper we present a variant of Weyl’s 
inequality for systems of forms, and give two applications of our new form of Weyl’s 
inequality. 

First this allows us to replace the dimension of the locus V* in Birch’s theorem 
on system of forms by a quantity which appears to be more natural in this context, 
and bounds the previously used quantity from below. For an integer vector b € Z’ 
we let fy = bifi+...+0,f,- be the form in the pencil of f,...,f associated to b. For 
any homogeneous form g we write Sing(g) for the singular locus (in affine space) 
of the form g = 0. We can now state a variant of Birch’s theorem on forms in many 
variables as follows. 


Theorem 1. Assume that 


n— max (dim Sing(f,)) > r(r+ 1)(d—1)271. 
beZ’\{0} 


Then we have the asymptotic formula 
N(P) = 6 ygP"" + O(P™**), (1) 


for some 6 > 0. Here S and Y are the singular series and singular integral. 


This is essentially the main theorem of Birch’s work [2] where the quantity 
dim V* is replace by maxpez\s0} (dim Sing(fy)). In other words we can now describe 
the singularity of the system of forms f;, 1 < i < r by the maximal dimension of the 
singular loci of forms in the pencil. To our knowledge, it is hard to find a satisfactory 
geometric interpretation for the locus V* (see for example work of Aleksandrov and 
Moroz [1] into this direction). 
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Furthermore, we point out that for any non-trivial form f, in the pencil, the 
dimension of the singular locus dim Sing(fy) is always bounded by dim V~*. Indeed, 
the singular locus of the form ff, is given by 


af, 
ib Wa...0b “MWe PeieA 
Ox; Ox; 


If some vector x is contained in Sing(f,), then these relations imply that the rank of 
the matrix GE) can be at most r. This shows that Sing(f,) C V*, and dim Sing(f,) < 
dim V* for any non-zero vector b. 

Hence Theorem | formally implies Birch’s theorem in [2]. Furthermore, there 
are examples of systems where Theorem | is stronger than Birch’s main theorem 
in [2]. For simplicity of notation let 


u=u(f):= max (dimSing(f,)). 
beZ’\{0} 


Let k > r — 1 be some integer and consider the system of quadratic forms 
k 
Oi(x, y) = > ae l<i<r, 
j=l 


in the k(r+ 1) variables x; for 1 <j < k and y; for] <i<rand1 <j <k. A short 
computation reveals that 


dim V* =k(r—1)+r—-l=u4+r-l. 


Note also that once we choose k sufficiently large, Theorem | is indeed applicable. 
On the other hand these examples are essentially sharp. If we work over the complex 
numbers then we have 


V* = Upecr\so} Sing (fy). 
and this leads to the bound dim V* < uc + r— 1, where 


uc := max (dimSing(fp)). 
bec'\{0} 


Since being published in 1962, Birch’s work [2] has received a lot of attention 
and has been generalised in multiple directions. It seems natural to expect that 
our observation and new formulation of the main result in Theorem | can in an 
analogous way be transferred to most of these generalisations and developments. 
Some examples to mention are work of Brandes [3] on forms representing forms and 
the vanishing of forms on linear subspaces. Furthermore, the analogue of the locus 
V* in work of Skinner [11], which generalises Birch’s theorem on forms in many 
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variables to the number field situation, and work of the author [7] on bihomogeneous 
forms, could very likely be replaced by a non-singularity condition on forms of the 
linear system. Another result and application in this direction is a paper of Lee [6] 
on a generalisation to function fields F, |]. 

As a second application of our new form of Weyl’s inequality for systems of 
forms, we can strengthen a theorem of Schmidt [10], which provides an asymptotic 
formula for the counting function N(P) as soon as a so-called h-invariant of the 
system is sufficiently large. As a special case of this we recover the results of 
Dietmann’s work [4] on systems of quadratic and cubic forms. 

For a homogeneous form f(x) € Q[x] we define the h-invariant of f to be the 
least integer ) such that f can be written in the form 


h 
£0) = Yo gilx)gi(x), 


i=1 


with forms g;(x) and g/(x) of positive degree with rational coefficients. For a system 
f of homogeneous forms f;(x), | <i < r, of degree d, we define the h-invariant h(f) 
to be the minimum of the /-invariant of any form in the rational linear system of the 
forms, i.e. we set h(f) = minpezr\so} A(fo)- 

We say that a system of forms f of degree d is a Hardy-Littlewood system if the 
conclusion on the asymptotic formula for the counting function N(P) as in Eq. (1) 
in Theorem | holds. If the h-invariant of a system of homogeneous forms of the 
same degree is sufficiently large, then Schmidt proves in his work [10] that f is a 
Hardy-Littlewood system. As we shall indicate in Sect. 2, his results easily imply 
the following theorem, which we state here for convenience. 


Theorem 2 (Schmidt, 1985, See [10]). There exists a function $(d) with the 
following property. If the system f of homogeneous forms of degree d > 1 has a 
h-invariant which is bounded below by 


hA(f) > $A + Da@- 12%! + @-Vr(r— 0), 
then the system f is a Hardy-Littlewood system. Furthermore, one has $(2) = 
(3) = 1, (4) = 3, 6(5) = 13 and $(d) < (log 2)~@d! in general. 


Note that the function ¢(d) is exactly the function occurring in Proposition [Ic 
in Schmidt’s work [10]. 

Our new form of Weyl’s inequality improves on this theorem in the following 
way. 


Theorem 3. Let }(d) be the function as in Theorem 2. If the system f of homoge- 
neous forms of degree d > | has a h-invariant which is bounded below by 


hf) > b(d)r(r + 1I)(d—1)2"1, 
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then f is a Hardy-Littlewood system. In the case d = 2 one may replace the condition 
on the h-invariant of the system by the assumption that the rank of each form in the 
rational linear system of the quadratic forms is bounded below by 2r(r + 1). 


The special cases of degree d = 2 and d = 3 in Theorem 3 reduce to Theorem 1 
and Theorem 2 in Dietmann’s paper [4]. In the quadratic case Dietmann improves on 
previous results of Schmidt in [8] in reducing the lower bound in the rank condition 
from 2r? + 3r to only 2r? + 2r, and in the cubic case he reduces the lower bound 
on the h-invariant from 107? + 6r (see Schmidt’s paper [9]) to 8r? + 8r. In fact, our 
new form of Weyl’s inequality takes up the main idea in Dietmann’s work [4]. 

As Dietmann points out in [4], the A-invariant can in some ways be seen 
as a generalisation of the rank of a quadratic form to higher degree forms. By 
diagonalising a quadratic form one sees that its h-invariant is bounded by its rank. 
However, we note that these two notions do not coincide for the case of quadratic 
forms, as examples built up from forms like a _ i = (x, + x2)(%) — x2) show. 
Hence we need to formulate the case d = 2 in Theorem 3 separately in order to 
obtain the full strength of the theorem in this case. 

As another example we consider the case of systems of forms f of degree d = 4. 
In this case one has #(4) = 3 and Theorem 3 implies that the expected asymptotic 
formula for N(P) holds as soon as 


h(f) > 3r(r +1) x 3.x 2? = 9x (87° + 87). 


Schmidt obtains the same result in his paper [10] (see Theorem 2 above) under the 
stronger condition 


A(f) > (4 (rr + Yd — 1924! + (d= 1)r(r— 1) = 909 +78). (2) 


We finally remark that if the system of forms f;(x), 1 < i < r, in Theorem | or 
Theorem 3 forms a complete intersection, and if there exist non-singular real and 
p-adic points on the variety X given by these forms, then the singular series S and 
the singular integral Y are both positive. In particular, this implies the existence of 
rational points on the variety X as soon as there are non-singular solutions at every 
place of Q including infinity. 

The structure of this paper is as follows. We recall a version of Weyl’s inequality 
from [2] in the next section and present in Lemma 2 our new variant of Weyl’s 
inequality for systems of forms. We use this in the last section to deduce Theorems | 
and 3, and we explain the improvements of Theorem 3 compared to Theorem 2. 

After Dietmann posted [4] on the ArXiv, containing the new variant of Weyl’s 
inequality and its application to systems of quadratic and cubic forms, he (in the 
revised published version [5]) and the author independently generalized his work 
to systems of forms of higher degree, and to Birchs theorem, obtaining the same 
conclusions Theorems | and 3. Most of our reference to his work hence refers to 
the earlier version [4], and now Lemma 2 could also be deduced from the later 
version [5]. 
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For some n-dimensional box &, some real vector @ = (a1,...,a@,) and some large 
real number P we define the exponential sum 


S@= > -(Sane). 


xePZnz" \i=1 
If f(x) is some homogeneous form of degree d, then we let Tx, ...,X) be 
its unique associated symmetric multilinear form satisfying I+(x,...,x) = d!f(x). 
Moreover, if f;(x), 1 < i < r, form a system of homogeneous forms of degree d 
as before, then we let Rx®, ts x), 1 <i < r, be the associated multilinear 
forms. We introduce the sup-norm |x| = maxj<;<, |x;| on the vector space R”, 
and write ||y|| = min,ez|y — y| for the least distance of a real number y to an 


integer. Furthermore, we write here and in the following e; for the j-th unit vector 
in n-dimensional affine space. Then we let N(P*; P~"; «) be the number of integer 
vectors x... , x with |x®|,...,|x@| < P® and 


7 
be a T;(@;, xO, x) 
i=] 


We start our considerations with recalling Lemma 2.4 from Birch’s work [2]. 


Lemma 1 (Lemma 2.4 in [2]). Let k > 0 be a real parameter. For fixed0 <0 <1 
one of the following alternatives hold. 


(i) |S(oe)| <P", or 
(ii) N(P°; P~4+ EYE: @) >> pend ke for anye> 0. 

The main idea is to treat the condition (ii) differently than in Birch’s work [2], 
following a similar idea as taken up in the paper [4]. Before stating our new version 


of Weyl’s inequality, we need to introduce the g-invariant of a homogeneous form. 
We define .% to be the variety in affine (d — 1)n-space given by 


Tex 020,39) =0, Lessa, 
and write .%(P) for the number of integer points on .4% with coordinates all 
bounded by P. Then we define the g-invariant g(f) of the form f to be the largest 
real number such that 


M,(P) <K pa-Da-ste ; 


holds for all ¢ > 0. Note that this number g(f) coincides with the g-invariant that 
Schmidt associates to a single form f in [10]. 
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Lemma 2. Let g = infpezr\s0} g(fp) and let0 < 0 < 1 be fixed. Then we either 
have the bound 


(i) |S(oe)| < Pr?“ "'86+8, oy 

(ii) (major arc approximation for o with respect to the parameter 0) there exist 
natural numbers a,,...,a,; and1<q<« Pe-)? With gcd(qg,a),...,a,) = 1 
and 


|qa; —ai| << Pewee Tie 


Proof. Let the notation be as in Lemma | and assume that alternative (ii) in 
Lemma | holds. 

We consider the matrix y of size r x (nN(P°; P~¢*+—-)®: @)) with entries 
T,(e;,x™,...,x) in the ith row. The columns are indexed by (j,x®,..., x), 
where 1 <j < nandx®),...,x run through all tuples of integer vectors counted 
by N(P°; P-4+(4—-)9: @), We distinguish two cases. 

Case (a): Assume that rank(y) = r. Then there is a r x r-minor Vv of full rank, 
which we say is given by 


2 d 
v= = (Widi<itcr = = (17 (ej,. X; x! 2 tee .x! ieee. 


In particular, we have || )\)_, aWiul| < P~¢*-", for all 1 < 1 < r. Hence, we 
can write 


~ avis =a+ bp, 
i=1 


with integers a) and real numbers 8) with [dik = perie“N! Let yes be the adjoint 
matrix to w, which satisfies yy = = (det v) id, and let g = det yw. Since W was 
assumed to be of rank r, its determinant q is non-zero. Furthermore we note that 
lq| < P’4-)), Now we can use the adjoint matrix yrs to find a good approximation 
for a by rational numbers with small denominator. Indeed, we have 


r 
adj ~ 7 adj)) 9 
got; — Dy Vi't| < Do Wir 116 
I=1 
K pir-VG-I) p-d+(d-)8 
We set aj = oi) Vy ay. After removing common factors of g and the integers aj, 


we obtain integers g,a),...,a, with gcd(q, a1,...,a,) = landl <q « P™, 
such that 


|gari — ail <K prdtra-1e | 1 < i <r. 


In this case the conclusion (11) of the Lemma holds. 
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Case (b): Assume that rank(y) < r. Then the r rows of w are linearly dependent 
over Q, and thus there exist integers b,,...,b, € Z, not all zero, such that 


; 
De Cee arreree areas 
i=1 


for all 1 <j < nand forall tuplesx®, ...,x counted by N(P?; P~7+—))®; a). We 
note that 


: 
> b TK, ...,x) = Tex, ...,x) 
i=1 


is the multilinear form associated to the form f,(x) = )>;_, Difi(x). We recall the 
definition of the variety .% stated before this lemma, and deduce from the lower 
bound on N(P?; P~2+(4-)®: &), that we have 


Mp, (P°) > Pd Ind— 2 he | (3) 
for any ¢ > 0. By definition of the g-invariant g(fp) we see that Eq. (3) implies that 
(d—1)n—24'k/0 —e < (d—1)n—glf) +, 
for any ¢ > 0. Hence we have 2~“*! 9(f,)0 < k for some b € Z’ \ {0}, and the first 


alternative of the lemma holds. 


We remark that Lemma 2 has recently also appeared in the form of Lemma 2 
in [5]. 


3 Applications 


The main goal of this section is to prove Theorems | and 3. Before we show how 
Lemma 2 implies Theorems | and 3 we recall some very general results from 
Schmidt’s work [10], which simplify our following arguments. 

For this we first recall the Hypothesis on f introduced in Sect. 4 in [10] for the 
case of forms of same degree. 


Hypothesis 4 (Hypothesis on f with Parameter (2) Let Z be some box and 
A > 0 and assume that P is sufficiently large depending on the system f, the 
parameter 2, the box B and A. Then one either has the upper bound 

i) |S(a)| < P*-4%, or 

ii) there are natural numbers q < P4 and a,...,a, such that 


|qa; — a;| < Pats 1 <iK<r. 
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In his work [10] Schmidt shows that this hypothesis is enough to verify that 
the Hardy-Littlewood circle method can be applied to the counting function N(P) 
related to the system of equations f. One of his main results is the following, which 
we only state for the special case of forms of the same degree, since this is all we 
use in this paper. 


Theorem 5 (Proposition I in [10], Second Part). Suppose the system f satisfies 
Hypothesis 4 with respect to some parameter 


2>rt+i. 


Then f is a Hardy-Littlewood system. 


In combination with our new version of Weyl’s inequality in Lemma 2 we obtain 
the following useful corollary. 


Corollary 1. Assume that f is a system of homogeneous forms of the same degree 
with 
inf > Iy(d—1)277. 
pei, Sb) > r+ Id 1) 


Then the asymptotic formula for N(P) as predicted by the circle method holds, i.e. f 
is a Hardy-Littlewood system. 


Proof. First we note that Lemma 2 implies Hypothesis 4 with respect to any 
parameter 


@ 2a di) 2. 


This is clear by the formulation of Lemma 2 for the range 0 < A < r(d— 1) if we 
set A = r(d — 1)@. In the case where A > r(d — 1) alternative (ii) in Hypothesis 4 
is automatically satisfied by Dirichlet’s approximation principle. 

Now Proposition I in [10] applies as stated in Theorem 5, which completes the 
proof of the Corollary. 


Next we relate the g-invariant of a homogeneous form to the dimension of its 
singular locus, and thereby establish the new formulation of Birch’s theorem on 
forms in many variables as stated in Theorem |. 


Proof (Proof of Theorem 1). Consider some vector b € Z” \ {0} and its associated 
form fp in the pencil of f;(x), 1 < i < r. We note that the intersection of the affine 
variety .%;, with the diagonal J given by 

x2 =. =x, 


is isomorphic to the singular locus of the form jf. Hence we obtain by the affine 
intersection theorem that 


dim Sing(f,) = dim(A N -G,) = dim J + dim. G, — (d — 1)n. 
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This shows that 
dim Sing(f,) = dim.4@, — (d— 1)n+n, 
which implies that 
g(fp) = n— dim Sing(fp). 


Taking the infimum over all non-zero integer tuples b we obtain 


inf >n— max (dim Sin ; 
yell SU) =m, max, (dim Sing(fs)) 


and hence Lemma 2 holds with g replaced by n — maxpezr\,o} (dim Sing(f,)). This 
shows that in Lemma 4.3 in Birch’s work [2], the quantity K which is defined in his 
setting as 


24K =n—dimV*, 


can be replaced by 


241K =n— max (dim Sing(f,)). 
beZ’\ {0} 


Now Theorem | follows identically as the main theorem in Birch’s paper [2]. 
Alternatively we can apply Corollary | to obtain the desired result. 


Next we turn towards the proof of Theorem 3 which improves on the previous 
known results in Theorem 2. However, since Theorem 2 is not contained in this 
formulation in the paper [10], we first give a short deduction of it from the 
results of [10]. Indeed, Schmidt concludes in his remark after Proposition [Jp that 
the expected asymptotic formula in Theorem 2 for N(P) holds as soon as a so called 
g-invariant g(f) of the system f is larger than 


g(f) > 2°" (d— Dr(r + 1). (4) 
His Corollary after Proposition III states that there is the relation 
hf) < o(a)(g) + (d— rr — 1). (5) 
Hence the condition 
h(f) > P(A (rr + Id = 128% + = Ir(r= 1), 


in Theorem 2 implies that (4) holds and thus the conclusion of Theorem 2 follows. 
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The main difference in the use of our new version of Weyl’s inequality in 
comparison to Schmidt’s work is that we can state everything in terms of the 
g-invariant of a single form. In his work [10] Schmidt uses a form of Weyl’s 
inequality where the infimum of all g-invariants of the elements of the rational linear 
system is replaced by his so called g-invariant g(f) of the whole system. This is in 
complete analogy with the replacement of the locus V* in Birch’s work [2] by the 
maximal dimension of the singular loci of elements in the rational linear system as 
in Theorem 1. 


Proof (Proof of Theorem 3). For a single form f, Eq. (17.2) in [10] implies that 


hf) < (d)s(f), (6) 


which should be compared to the relation (5) for systems of forms. For a single 
form we do not need the term ¢(d)(d — 1)r(r — 1), which is present for the relation 
referring to the whole system of forms in Eq. (5). This explains our improvement of 
Theorem 3 compared to Theorem 2. 

Assume now that the assumptions of Theorem 3 are satisfied, i.e. 


A(f) > o(d)r(r + I(d—1)2"1. 


Recall that we have defined h(f) = minpezr\ 0; h(fp). Hence we obtain together with 
Eq. (6) the relation 


inf 1I)(d—1)2. 
pet SU) > r(r+ I)(d— 1) 


Now we apply Corollary | to complete the proof of Theorem 3 for the case of degree 
d > 3. 

For the case of systems of quadratic forms we note that the g-invariant of a single 
quadratic form f is bounded below by its rank. Indeed, let some quadratic form f be 
given by some n Xx n-matrix A. Then the variety .% is given by the system of linear 
equations Ax = 0, and we deduce that 


M;(P) < pretank(A) 
which shows that g(f) > rank(A). Now we apply Corollary 1 as in the case d > 3. 
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The Breuil-Schneider Conjecture: A Survey 


Claus M. Sorensen 


Abstract This note is a survey of the Breuil-Schneider conjecture, based on 
the authors 30 min talk at the 13th conference of the Canadian Number Theory 
Association (CNTA XIII) held at Carleton University, June 16-20, 2014. We give 
an overview of the problem, and describe certain recent developments by the author 
and others. 


1 Prelude: The Fontaine-Mazur Conjecture 


Let F be a number field,! with absolute Galois group Ij = Gal(Q/ F ). The p-adic 
representations of Ij, which arise naturally, come from algebraic geometry. More 
precisely, if X/% 1s a (smooth projective) variety, one would look at irreducible 


constituents r : 1; — GL,(Q,) of the cohomology H2,(X xz Q, Q,(0). It is now 
known that such r are “geometric”, which by definition means the following: 


° Tw:=7|rs, is unramified? at all but finitely many places w } p. 
* 1, is potentially semistable at all places w\|p. 


The Fontaine-Mazur conjecture (Conjecture | in [23]) is the converse: These local 
conditions guarantee that r occurs in the cohomology of some X, up to Tate twist. 

In recent years, Emerton and Kisin have made impressive progress on this for 
odd two-dimensional representations of [—as a result of the proof of Serre’s 
conjecture [32], by Khare and Wintenberger! To give the flavor, Emerton shows 
the following result (see Theorem 1.2.4 in [21]). 


'This is the notation we will eventually use in Sect. 5 below, to distinguish it from F/Q,. 

*Here we tacitly fix algebraic closures F <> F,, extending F <> F,,, in order to identify the 
decomposition group Ip, = Gal(F,,/ Fy) with a subgroup of I. We say 1, is unramified if its 
restriction to the inertia group Jr, = Gal(F,,/F"”) is trivial. 
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Theorem 1.1 (Emerton). Let r : Ig > GL2(Q,) be an irreducible odd repre- 
sentation, p > 2. Assume | rog,) is irreducible, and that T|Po, is “generic”. If r is 
geometric, with distinct Hodge-Tate weights, then (up to a Tate twist) r = ry for 
some cuspidal eigenform f of weight k = 2. 


Remark I. We write r for the reduction of r modulo p. Let us add a few details. 
By the compactness of Ig, the image of r lies in GL2(@¢) for some finite 
extension E/Q,. Composition with Gg — ke = Of/@r@z yields the “naive” 
reduction mod p into GLy(kg) C GL2 (F,). The Brauer-Nesbitt principle tells us its 
semi-simplification is independent of the choice of stable @g-lattice in E*. This 
semisimple representation is what we denote by r. Of course, in Theorem 1.1 r 
is irreducible, so it’s given by any of its naive reductions. The notion of 7| Te, being 


gigas : ‘ = 1 x 1 x 
“generic” is a bit ad hoc. Here we take it to mean that r| Fagg ( i) or ( - i; 
: Ecye 


even after twisting by a character, and * may or may not be zero. (Here &,,, is the 
mod p cyclotomic character.) 


Since there is a motive attached to f, by Scholl and others [36], one knows how 
to carve out r = ry in the cohomology of some (Kuga-Sato) variety X/g. 

Very broadly speaking, the proof of 1.1 has two steps. The first step is essentially 
Serre’s conjecture, which combined with “big” R = T theorems of Béckle [8] tells 
us that at least r comes from a p-adic modular form; we say r is pro-modular 
(closely related to the modularity of r). The second step is to establish classical 
modularity of r, using that r|r, is potentially semistable of regular weight. This 
is where the p-adic local Langlands correspondence IT(-) for GL2(Q,) comes in. It 
goes from two-dimensional p-adic representation of Ig, to unitary p-adic Banach 
space representations of GL2(Q,,), and its “inverse” has a nice clean description via 
Colmez’s so-called Montreal functor. We’re applying /T(-) to the restriction r| Pep: 
Emerton shows in [21] that it satisfies local-global compatibility, which basically 
says that 


T(r|ro,) > Homr,(r, Hi), 


where H! is a huge Banach-space of p-adic modular forms (the completed coho- 
mology of the tower of modular curves). The condition at p ensures that IT(r| To,) 
has so-called “locally algebraic” vectors—much more on these in the next section! 
Hence so does Homy,(r, H '). But the locally algebraic vectors in H! have a 
description in terms of classical modular forms, and this essentially finishes the 
proof. 

This is meant to motivate the search for a p-adic local Langlands correspondence 
for GL,(F), for finite extensions F/Q,, and other groups. The Breuil-Schneider 
conjecture is one of the initial steps towards a precise formulation of what one is 
looking for. 
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2 Motivation: p-Adic Local Langlands for GL2(Q,) 


The classical local Langlands correspondence for GL2(Q,) is a bijection between 
certain two-dimensional Weil-Deligne representations of Wo, and irreducible 
smooth representations of GL2(Q,). Here the coefficient field FE is unspecified. One 
can take the complex numbers C, the ¢-adic numbers Qy for £ 4 p, or even Q— 
which we eventually will. The topology of E plays no role in the correspondence. 
A natural source of Weil-Deligne representations are Galois representations. This 
is where the topology of E plays a huge role. Thus, for £ 4 p, with any continuous 
representation p : Ip, = Gal(Q, /@) > GL2(Qy) one can associate a Weil- 
Deligne representation WD(p), and hence an irreducible smooth representation 
Ism(p) of GL2(Q,) over Qe. Moreover, one can recover p from 1sy,(p). This 
completely fails for £ = p, and this is what leads to the p-adic local Langlands 
correspondence. The classical local Langlands correspondence for GL(2) has been 
dealt with in detail in [7]. For GL(7) there are several good expositions. We mention 
(31, 41]. 

In what follows we will fix our coefficient field E, which we take to be a finite 
extension E/Q,. Suitably normalized, classical local Langlands is Mg, -equivariant 
and can therefore be defined over E (as opposed to Q,). If we restrict ourselves to 
potentially semistable representations p : Ig, — GLa(E), a recipe of Fontaine 
associates a Weil-Deligne representation WD(p), and hence z;,,(p) still makes 
sense. See [10, 29] for details on how this goes. However, p “> 7sm(p) is no longer 
reversible. In addition, we have the Hodge-Tate weights, which we will assume 
are distinct, HT(o) = {w1 < w2}. These correspond to an irreducible algebraic 
representation staig(p) of GL2(Q,)—the one of highest weight t +> r'?~'2Y' (relative 
to the upper-triangular Borel). More concretely, 


Talg(p) = det”! Qe Sym"2-"!—! (BE), 


Still, one cannot reconstruct p from 7sm(p) and zaig(p). In a nutshell, the problem 
is the following: In p-adic Hodge theory, the potentially semistable p are classified 
by linear algebra data which includes a certain Hodge filtration—and this is lost in 
the process of constructing Zqig(). We only see its jumps. 

The p-adic local Langlands correspondence takes any continuous representation 
p : Ig, —> GL»(E) and attaches a Banach E-space /7(p) with a unitary 
GL2(Q,)-action. Consult [16] for the most recent developments (for small p). This 
map p »> IT(p) is reversible, and compatible with classical local Langlands in 
the following sense: When p is potentially semistable, with distinct Hodge-Tate 


weights, 
IT(p)“"8 = Talg(P) @E Tsm(P) |. (1) 


Furthermore, /7(o)“® = 0 otherwise. Here the superscript alg indicates taking the 
locally algebraic vectors—an invariant subspace between the smooth vectors and 
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the locally analytic vectors. Recall that a vector is smooth if some compact open 
subgroup acts trivially. Analogously, a vector is locally algebraic if some compact 
open subgroup acts polynomially. (A good go-to source for these notions, as well as 
local analyticity, is Breuil’s ICM article [11].) 

When p is irreducible, [7(p) is known to be topologically irreducible, and 
therefore the completion of Zaig(P) @z Asm(p) relative to a suitable GL2(Q,)- 
invariant norm || - ||, which somehow “corresponds” to the lost filtration. 

For GL,(Q,), or even GL2(F) for finite extensions F/Q,, the p-adic local 
Langlands correspondence remains elusive. The Breuil-Schneider conjecture is 
in some sense a “first approximation”, which uses (1) as a guiding principle: 
The right-hand side can be defined for any potentially semistable representation 
p : Ir — GL,(E), with distinct Hodge-Tate weights, and the conjecture is 
that Zaig(P) ®e Hsm(P) admits at least one GL, (F)-invariant norm. The resulting 
completions should be closely related to the yet undefined /7(o)—at least in the 
irreducible case. 

Of course, ultimately one would want more than the mere existence of an 
invariant norm || - || on zaig(P) @z Asm(e). In general it will not be unique (up to 
equivalence), but should correspond to the possible compatible Hodge filtrations. In 
the case of GL2(Q,) this is exemplified by the connection to -invariants. 


Example 1. Let &/g, be a semistable elliptic curve. That is, its reduction mod p 
has a nodal singularity (as opposed to a cusp). This can always be achieved by 
passing to a finite extension. By the theory of the Tate curve, any such & has p-adic 
uniformization; meaning there’s a unique g € Qx with |g| < 1 such that & (Q,) a 


Q* /q@, respecting the I Q,7action. The Galois action pg, on the Tate module 


V,(E) := Q, ®z, To(€) = Q®,  7,(€) := lim é'[p"] ~ i Aaa 


n 


can then be made explicit in terms of the parameter g—or rather the “-invariant, 
L = Le = log(q)/ord(q). 


(Here log is the Iwasawa-log, with log(p) = 0.) This @ is encoded in pg p via p-adic 
Hodge theory; it can be read off from the Hodge-filtration (see [5] for the explicit 
relation). In this example, HT(p¢,)) = {0, 1}, so muig(pep) = 1 and Maig(pep) = 
St; the Steinberg representation (up to an unramified quadratic twist). Recall that, 


St = StcL,Q,) = 6° (P'(Q,))/{constants}. 


The sup-norm on St is certainly GL2(Q,)-invariant, and “corresponds” to “ = 0 
(that is, g = p). In [6], it is shown how any -@ defines an invariant norm || - ||.v on 
St, such that I7(p¢) = St (the completion). In fact, more generally, in loc. cit. 
Breuil defines a unitary Banach space representation B(k, @) of GL2(Q,), for any 
k>2and Ye Q,. The dual B(k, #)’ is a certain space of log ,-rigid functions on 
the p-adic upper half-plane 2 = C,\Q,. Furthermore, 


The Breuil-Schneider Conjecture: A Survey 223 


Bk, L)*8 = | det |'2 Sym‘ @ St. 


(Breuil shows this for k = 2 and for k > 2 when Y comes from a modular 
form; but it’s now known in general due to subsequent work of Colmez and 
others.) These Banach representations B(k, @) correspond to the semistable non- 
crystalline representations p : Ig, > GL» (Q,), with weights {0,k — 1}, under the 
p-adic local Langlands correspondence. It might be worth emphasizing that a very 
special feature for GL2(Q,), is that for (irreducible) crystalline representations p, 
the invariant norm on Zaig(P) @Ez Asm(p) is in fact unique up to equivalence—this is 
extremely important in local-global compatibility; which explains how IT(p) “sits” 
in completed cohomology when p arises from a p-adic modular form—see [21]. 


3 The Breuil-Schneider Conjecture: The Statement 


Besides the coefficient field E/Q,, we now fix a base field F/Q,. We assume 
throughout that F is large enough to contain the Galois closure of F. Now suppose 
p: ly — GL,(£) is a potentially semistable representation, with labeled Hodge- 
Tate weights HT,(e) = {wi < +--+: < Wn,r}, for each embedding t : F @ E. As 
in Sect. 2, one defines a smooth E-representation 7,,,(0) of GL,(F) via the classical 
local Langlands correspondence for GL, plus Fontaine. There is a small wrinkle 
here, which we are tempted to gloss over: There is an issue with genericity—not in 
the sense of the Introduction, but in the (usual) sense of 75,,() having a Whittaker 
model.? When 75(p) is non-generic, one should really replace it by a generic (but 
reducible) principal series, of which it is quotient. For instance, in the GL»-case, if 
p is the trivial representation one would take z,,,(o) to be the non-split extension 
of 1 by Steinberg. For the most part we will sweep this under the rug, and usually 
assume p is generic (i.e., that 7s,(e) is generic). 

For each r, we let zaig.r(p) be the irreducible algebraic representation of GL, (F) 
of highest weight (w,,. — (n — 1),...,w1,r) relative to the upper-triangular Borel. 
Then let zaig(P) = @reHom(F,£)Male,c(p), with GL,(F) acting diagonally. The 
Breuil-Scneider recipe, from [10], takes the tensor product, and gives a locally 
algebraic representation (again, with GL,(F) acting diagonally): 


Definition 3.1. BS(e) := Zaig(P) @z Hsm(p). 


Upon a closer look, this definition works in greater generality. It only depends 
on the data WD(p) and HT,(p) associated with p. In other words, we could start 
with data ZY = (WD, HT,), consisting of a Weil-Deligne representation WD, and 
tuples of integers HT; = {w1,, < --: < Wyx}, and define BS(Y) as above. The 
main conjecture of [10] is as follows (Conjecture 4.3 on p. 18 in loc. cit.). 


3Meaning the representation 7r,,,(¢) embeds into Indpi" (v) for some additive character w # 1. 
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Conjecture 3.2 (Breuil, Schneider). The data Y arises from a potentially semistable 
representation if and only if BS(Z) admits a GL, (F)-invariant norm. 


The “if” part of this conjecture is completely known [29], and is due to 
Y. Hu—who proved a stronger result: First, WD arises from an étale (¢, N)-module 
Dina standard way (see 2.2 in [29]). As part of his Orsay Ph.D. thesis, Hu translated 
the existence of an admissible filtration on D, with jumps HT,, into a purely group- 
theoretic condition known as the “Emerton condition” (in the vein of Theorem | in 
[25], which deals with the crystalline case). We rephrase part of Theorem 1.2 (or the 
slightly more general Theorem 2.12, p. 12) in [29] in our notation: 


Theorem 3.3 (Hu). Y comes from a potentially semistable representation if and 
only if the Emerton condition holds: For every parabolic P, with Haar modulus 5p, 


\(6n'D@Ip <1, Vee Zt, Vx Jp(BS(F)). (2) 


Here Zi is the contracting monoid in the center of the Levi M = Mp, and Jp(-) is 
Emerton’s locally analytic Jacquet functor (introduced and studied in [20]]). 


Let us add a few words on the contracting monoid (see p. 33 in [20] for more details): 
Once and for all, fix a compact open subgroup Ny C N = Np, and let Mt = {me 
M|mNom"! C No}. Then Zi = M* f Zy. In the case of GLy, taking P to be the 
standard parabolic corresponding to the partition n = n, +----+n,, it’s given by 


Zz, = {diag(t; In, oe sty Tn,) : |t1| < eee eag s |t,|}. 


It may also be worth pointing out that since BS(A) = Maig(F) @ez Wsm(F) is 
locally algebraic, we strictly speaking don’t need the locally analytic extension of 
the Jacquet functor here: By Proposition 4.3.6, p. 63, in [20], 


Ip(BS(Y)) ~ Taigl BY” ® asm(D)wp- 


Here the Np-invariants Talg(D)N? is an irreducible algebraic representation of 
Mp, and the Np-coinvariants 7s,(D)y, is the usual Jacquet module in smooth 
representation theory (up to a twist). 

It is relatively easy to show that (2) is satisfied if BS() carries an invariant 
norm, which is how the condition arose in the first place. This takes care of the 
so-called “easy” direction of Conjecture 3.2. The “only if” part is still open; the 
existence of anorm on BS(p). The purpose of this note is to report on recent progress 
in this direction. Note that asking for a norm amounts to asking for a (separated) 
lattice: Given || - ||, look at the unit ball A. Conversely, given A, look at its “gauge” 
\|x|| = aguas where v (x) is the largest v such that x € mw, A. Thus we are looking 
for integral structures in BS(p). 
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3.1 Forerunner: The Crystalline Case 


For irreducible crystalline representations p : Ig, — GL2(E), the p-adic local 
Langlands correspondence for GL2(Q,) has a very nice description: Berger and 
Breuil showed that BS(p) admits a unique invariant norm, up to equivalence, and 
IT(p) is the completion. See [4], or Theorem 2.3.2 on p. 8 of [5] (which is about the 
existence of a finite type lattice; all such are obviously commensurable). 

In a precursor to [10], Schneider and Teitelbaum tried to extend this picture to 
higher-dimensional crystalline representations p : Ip — GL,(E). Here 75(p) 
is an unramified principal series representation, corresponding to a character of 
the spherical Hecke algebra, € : #(G,K) — E, where we have introduced 
G = GL,(F) and K = GL,(@,). Alternatively, one may then think of z,,(p) 
as a “universal module” (as in works of Borel, Matsumoto, Serre, Lazarus, etc.) 


Ism(p) = €-(G/K) @v(G.K).6 E, 


where @.(G/K) denotes the space of compactly supported E-valued continuous 
functions G/K (i.e, finitely supported functions since G/K is discrete). This can 
be generalized to take into account the algebraic representation 7qjg(p), which is 
traditionally denoted &. It defines a variant #(G, K) of the spherical Hecke algebra, 
consisting of End(&)-valued K-biequivariant functions on G. When € = 1 one 
recovers #(G, K). Since & is an irreducible representation of G (as opposed to 
just K), there is in fact a natural isomorphism #(G, K) = Hi(G,K) of E- 
algebras, which can be used to transfer ¢ to an eigensystem ¢ : #(G,K) —> E. 
See p. 639 in [35]. Unwinding the definitions, one finds that 


BS(p) ~ Hex := indg() @mGw. E, 


where ind? = c—Indg denotes the compact induction of the representation &|x (i.e., 
€-valued functions f on G such that f(kg) = &(k)f(g), and such that f is compactly 
supported modulo K). This interpretation is useful, by virtue of H;~ carrying a 
canonical G-invariant seminorm: Since K is compact, there is certainly a K-invariant 
norm || - ||; on &—and they are all equivalent since & is finite-dimensional. Thus 
ind? (&) is endowed with the sup-norm, and finally Hz ¢ acquires the quotient semi- 
norm. Since we mod out by a subspace which may not be closed, a priori it could 
happen that ||x|| = 0 for some x 4 0. A key construction in [35], p. 671, see also 
[34], is the following. 


Definition 3.4. B;~ is the Hausdorff completion of Hz ¢. That is, (H¢,¢/ {0t)*. 


Thus B¢¢ is a unitary Banach E-representation of G, which yields [7(p) when 
n = 2 and F = Q,). Unfortunately, in most other cases, it is not even known to be 
non-trivial! It would be bizarre, but we cannot rule out that 0 is dense in H; ¢. This 
led to the (weaker) predecessor of Conjecture 3.2: 
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Conjecture 3.5 (Schneider, Teitelbaum). Bz¢ A 0. 


For comparison, this is Conjecture 5.2 in [34]. Note that if BS(o) ~ Hg ¢ admits 
a G-invariant norm || - ||’, then in fact the canonical semi-norm on H¢¢ must be a 
norm (and consequently Hg ¢ <> Bg ¢ is dense): By Frobenius reciprocity, or really 
its “opposite’—which holds for compact induction, we have 


Homg (ind&(é), He) ~ Hom (€, Hg ¢|x), 


and all maps here are automatically continuous since dim(&) < oo, regardless of 
what topology we put on H¢ ¢. In particular the canonical projection z : ind? (§) > 
Hg ¢ is automatically continuous for the || - ||’-topology. Thus ker(z:) is closed. 

Although in general Bz is not even known to be nonzero, it is expected to be 
a huge inadmissible representation, with lots of (almost) quotients, for n > 2 or 
F # Q,. The expectation is that there should be a G-map, with dense image, 


we 
Bey — > IT(p), 


for any irreducible crystalline p with mtgig <> & and ms, <> ¢. There are scores 
of such representations in higher rank, corresponding to choices of admissible 
filtrations compatible with (€,¢). The motto here is that Bg~ parametrizes the 
crystalline part of p-adic local Langlands. 

There are other very important results in [35], but it would take us too far afield to 
describe them in detail here. Let it suffice to say they develop a (crystalline) theory 
for general split reductive groups G/;, and prove an analogue of Theorem 3.3 in that 
setup. See Theorem 5.5 in [34] for a nice summary. 


3.2 Miscellaneous Results 


There are various partial results towards Conjecture 3.2 scattered in the literature, 
all proved in a purely local setting. We give a quick (possibly incomplete) overview. 


¢ For GL2(F), de Jeso used compact induction in [18] to produce a separated lattice 
in BS(p), under some technical p-smallness condition on the weight. 

¢ For GL2(F), Assaf, Kazhdan, and de Shalit used p-adic Fourier theory for the 
Kirillov model to get integral structures in two cases: (1) 2sm(p) is unramified 
principal series, and zaig(e) is a twist of Sym" with n < qr, and (2) tsm(p) is 
tamely ramified and n = 0. See [1], and its prequel [30]. 

¢ For GL2(F), Vigneras studied local systems on the tree in [40], and was able to 
show Conjecture 3.2 in the Steinberg case (Proposition 0.9 in loc. cit.) and made 
progress for tamely ramified principal series (Proposition 0.10). 

¢ For general split reductive groups, Grosse-Klénne looked at the universal module 
for the spherical Hecke algebra, and was able to show cases of Conjecture 3.2 
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for unramified principal series, again under some p-smallness condition on 
the Coxeter number (when F = Q,) plus other technical assumptions. See 
Theorem 9.1 in [27]. 


4 The Indecomposable Case 


When s(p) is supercuspidal (equivalently, WD(p) is absolutely irreducible), there 
are many ways to construct a norm on BS(p). One can write 7,,,(o) as a compactly 
induced representation, or take the sup-norm of matrix coefficients. (This case was 
already observed as Theorem 5.2 in [10].) 

Note that the central character of BS(p) always takes values in OF. In fact 
this is equivalent to the Emerton condition (2) when WD(p) is indecomposable 
(equivalently, 75,,(o) is generalized Steinberg)—it turns out the Jacquet modules of 
Ism(p) are simple in that case, if nonzero, and as a consequence one only has to 
check (2) for P = G, which is p-adic unitarity of the central character. 

This leads to the following special case of Conjecture 3.2, which was mentioned 
explicitly as Conjecture 5.5 in [10]. 


Conjecture 4.1. Let p be a potentially semistable representation, with distinct 
Hodge-Tate weights, such that WD(p) is indecomposable. Then BS(p) admits a 
GL, (F)-invariant norm. 


The author recently proved this conjecture. In fact, he proved the following 
stronger result, which generalizes Conjecture 4.1 to arbitrary reductive groups [37]. 


Theorem 4.2 (S.). Let Gig, be a connected reductive group. Let Taig be an 
irreducible algebraic representation of G(Q,), and let msm be an essentially discrete 
series representation of G(Q,), both defined over E. Then Taig @£ Ksm admits a 
G(Q,)-invariant norm if (and only if) its central character is unitary. 


One deduces Conjecture 4.1 by taking G = Res g, GLy. Indeed the generalized 
Steinberg representations of GL,(F) are precisely the essentially discrete series 
representations—by which we mean some twist has L?-matrix coefficients. Here lies 
a subtlety worth pointing out: We are working over a p-adic field E, not C, so being 
L? only makes sense after choosing an embedding E <> C. To get a good notion 
over E, we need to know the set of essentially discrete series is stable under Aut(C). 
For GL,, this follows from the Bernstein-Zelevinsky classification (via segments of 
supercuspidals etc.)—see [31] for a nice overview. For a general G, it was shown by 
Clozel. 
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4.1 Sketch of the Proof of Theorem 4.2 


After fiddling around with the center, and using that the central character takes 
values in O7, one may assume G is semisimple. In turn, decomposing G* into a 
product of simple groups, and fiddling around with the tensor product norm (see 
paragraph 17, Chap. IV in [33]), one may assume G is in fact simple and simply 
connected. We are to show Taig @E Hsm always admits a norm—there is no center, 
hence no constraint. 


Step 1. We choose a global model G/g such that G(R) is compact. This uses a 
result of Borel and Harder in Galois cohomology [9], which shows that forms* 
(not necessarily inner forms) can be prescribed locally. 

Step 2. We apply the trace formula for G;g—which is as simple as the 
trace formula gets: There is no contribution from Eisenstein series since G 
is anisotropic. Since G(R) is compact, zig is discrete series. So is 7y,, by 
hypothesis. Therefore they admit pseudo-coefficients [14] (functions on the 
group which behave like matrix-coefficients). By a now more-or-less standard 
argument, which goes back to Clozel [15] (which in turn builds on [17])— 
in much greater generality, one can “isolate” the contribution from 7, and 
sm in the trace formula. The upshot being there exists /ots of automorphic 
representations IT of G(A) with IT,, = aig and IT, = asm (under some fixed 
and suppressed embedding E <> C). Here we use that G is simple to run the 
trace formula techniques. 

Step 3. Let &% be the space of automorphic forms on G(A). Following [26], 
for each compact open subgroup K C G(Ay), consider the space of algebraic 
modular forms on G, of weight aig and level K. That is, Homgyry (Zale. ag )— 
which contains Tif as a direct summand. Indeed, 


Homey (Taig, Le) ~ QB mo(s) «1p. 


IMoo=Talg 


Via E <> C this space gets identified with p-adic modular forms, 
Homx (Zig, 6G), where @G is the space of all continuous functions 
G(Q)\G(A;y) > E, and K acts via the projection to G(Q,). Thus, 

Talg ® nif alg ® Homg (Taig. 6a) = a 


When varying the level K,,, these maps are compatible, and give rise to a map 


(Zale &® Ty) &® ne = a. 


4Not to be confused with automorphic forms. For instance, the inner forms of GL(2) are the 
algebraic groups D™*, for quaternion algebras D, whereas the outer forms of GL(2) are the unitary 
groups in two variables. 
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Passing to the limit over shrinking K? yields mgig @ Hy = (alg ® Tsm) 
® IT? <> 6. The quotient G(Q)\G(Ay) is compact, so endowed with the 
sup-norm is a Banach space on which G(Ay) acts unitarily via right-translations. 
Restricting the sup-norm yields a G(A,)-invariant norm on Zig ® [;—and by 
further restriction a G(Q,)-invariant norm on qig®sm, Which finishes the sketch 
of the proof. 


Remark 2. To put things in perspective, @ can be thought of as AY, a very 
degenerate case of Emerton’s completed cohomology—developed in great gen- 
erality in [19]. Let H°(K)o, be the set of @g-valued functions on the finite set 
G(Q)\G(Ay)/K. When K > K’, there is a natural pull-back map H°(K)e, > 
H°(K’) o,- Fix a tame level K?, and take full levels of the form K = K,K?, with 
K, — 1. The direct limit Hie A °(K,K?) 6, is no longer wz-adically complete, so 


we consider its completion: 


H),,(K”) = (lim H"(K,K”)o,)*, A= lim Ho, (K”). 
Ky KP 


Thus AY. is the unit ball in a Banach E-space H®, which can be identified with 


E-valued continuous functions on G(Q)\G(A;). Alternatively, ee (K?) can be 
thought of as 


HG, (K°) := lim lim H°(KpK?) 6/0} 0¢- 
Kp s>0 


Indeed, the natural map Hy. (K?) > HY, (K?) is an isomorphism (see [19]). 


5 The Automorphic Case: Local-Global Compatibility at p 


The argument in Step 3 of Sect. 4.1 shows the following: If Gg is a reductive group, 
which is compact at infinity, and JT is an automorphic representation of G(A) with 
T]oq = Tatg and [Ty = sm, then Haig @E Hsm admits a G(Q,)-invariant norm. 

We will apply this observation to definite unitary groups: Thus> F /F Tt isa 
CM extension, and B is a central simple F-algebra of dimension n? equipped 
with an anti-involution * of the second kind (which means «|; = c). Introduce 
U = U(B,x) Jet and take its restriction of scalars, G = Resz+ jq(¥). We assume 


throughout that G(R) ~ U (n)Hom * 2) ig compact, and that G(Q,) can be identified 
with [,),, GLn(F) upon a choice of divisor t|v, for each place v|p of FT. 


v|p 


5We use the notation F to distinguish it from our p-adic base field F/ Q. 


230 C.M. Sorensen 


One of the goals of the “Book Project” is to attach Galois representations to 
automorphic representations [7 on G(A)—roughly by a base change to GL, (Aj), 
descent to an indefinite unitary group U’ with a Shimura variety Shy’, and a detailed 
study of the cohomology of Shy. The following result is now known, due to the 
collaborative efforts of many people. 


Theorem 5.1 (“Book Project”). Let IT be an automorphic representation of G(A), 
with [log of weight a = (dz) cyom(#+.R)- Fix an isomorphism tC — Q). Then 
there exists a unique continuous semisimple Galois representation, 


= Gal(Q/F) _— GL, (Q,), 


with the following properties: 


w 1 
(0) ra, = 1, ® Eye 


(1) For every finite place v of E+, which splits in F, and every place w\v, 
WD(rul ry, )P & recn(BC yy (To) ® | det |"). 


(Here BC,, denotes “base change” from U(FT) to GL,(Fy).) 
(2) rrulr;,, 8 potentially semistable for every w|p, with Hodge-Tate weights 


AT (r7y\r;,,) = {arj + (n —j) ‘J = 1, +: ny}, 


fort: Fy oO Q,. Here our convention is that HT; (€cyc) = {-l}. 


Remark 3. In this Theorem, rec, denotes the (classical) local Langlands corre- 
spondence for GL, (F, w), hormalized as in [28]. It’s a bijection between irreducible 
admissible representations of GL, (F w) (over C Q) and Frobenius-semisimple 
n-dimensional Weil-Deligne representations for F,,. The latter are pairs (r, N) 
consisting of a (nilpotent) monodromy operator N and a semisimple representation 
r of the Weil group Wj, , with open kernel. They naturally arise from potentially 
semistable p-adic representations of Ij . When w + p, every continuous represen- 
tation is potentially semistable, and (r, N ) is given by a simple formula (see [39] for 
details). For w|p the associated (r, N) is more complicated to define, and involves 
Fontaine’s period ring Byr (see [24] for details). The Weil-representation r may 
not be semisimple; the Frobenius-semisimplification is defined to be (r, N)’—*S = 
(7°, N). 


What is important to us here is the “local-global compatibility” in part (1) at 
the places v dividing p. This was proved in [2] and [3] under a certain regularity 
assumption, which was later removed in [12]. 

In [38] the author pushed the ideas from Sect. 4.1 further, combining them with 
Theorem 5.1, part (1) above p. A bit of book-keeping yielded the result below. 
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Corollary 5.2 (S). Conjecture 3.2 holds for p = rru\|r;_, for any place w|p of F. 


The implied norm on BS(p) comes from a G(Q,)-equivariant embedding 


&)BS(rail7,) Ge = {GQ\G(Ay) > Q}. (3) 
|p 


obtained as in Sect. 4.1. Here the tensor product is viewed as a representation of 
G(Q,) via the selection of places {v},),. The speculation aired in [38] is that the 
closure of the image of (3) ought to be “closely related” to ®,),/7(r77,.| Tp. ), where 
IT(-) should be the p-adic local Langlands correspondence for GL,, if it exists. 
(At least if all the restrictions ry,| rz_ are irreducible.) 

One can make this more precise: Given an automorphic representation JT as in 
Theorem 5.1, there is the associated Galois representation rj, of Ij, which we 
can restrict to the various decomposition groups at p—resulting in the collection 
{rr Tz, Swlp: On the other hand, the argument in Sect.4 (“Step 3”) yields a 
canonical G(A,)-invariant norm on (aig ® Hsm) ® I 7 —using the now known result 
that mg(/T) = 1. For each choice of vector 0 x x € ITP, this gives rise to a 
G(Q,)-invariant norm || - ||, On aig ® sm, by restriction. Since TT; is irreducible, 
it is easy to see that all these || - ||,, define the same topology. Thus we may also 
attach to IT (and 1) a unitary Banach space representation of G(Q,), namely the 
completion Br, = (Taig ® ism)’. It is natural to speculate that By, only depends 
on {r7, Te, hel p- That is, for two automorphic J7 and IT ’ as above, it is natural to ask 
whether or not 


; 
ralrz, ~ Toulrz,» Vwip => Bn. = Bru, 


and perhaps even conversely—under favorable circumstances. This seems to be 
a very hard question, which lies at the very heart of extending the p-adic local 
Langlands correspondence to GL,. 

Another speculation in [38] was how these things vary across the eigenvariety 
X = Xg, of some fixed tame level K?. On one hand, each point x € X(E) defines 
a pseudo-representation t, : Ij — E. On the other, one has the Hecke eigenspace 
er, which is a Banach-Hecke module over E with a unitary G(Q,)-action. This 
way X parametrizes a bijection 4, <> Go. which we like to think of as some sort of 
p-adic “global” Langlands correspondence. 


6 Patching and p-Adic Local Langlands 


Recently there has been spectacular progress on Conjecture 3.2 in the principal 
series case, which is the deepest, by joint work of Caraiani, Emerton, Gee, Geraghty, 
Paskiinas, and Shin. In the preprint [13], using global methods, they construct a 
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candidate IT(p) for a p-adic local Langlands correspondence for GL, (F), and are 
able to say enough about it to prove new cases of Conjecture 3.2. The following 
is their Theorem 5.3 (a less precise version of which is Theorem B in their 
introduction). 


Theorem 6.1 (Caraiani, Emerton, Gee, Geraghty, PaSkunas, and Shin). Sup- 
pose p t+ 2n. Let p : Ip —> GL,(E) be a generic potentially crystalline 
representation, of regular weight. If p lies on an automorphic component of the 
corresponding potentially crystalline deformation ring R5 (o)[1/p], then BS(p) 
admits a nonzero unitary admissible Banach completion. 


The conclusion is a little stronger than just the existence of a norm on BS(p), 
in that it asserts admissibility. They refer to it as “folklore” that every regular 
de Rham representation p should lie on an automorphic component. Thus they 
reduce Conjecture 3.2 to standard expectations in the automorphy lifting world. 
It should be emphasized that this condition is much weaker than saying p comes 
from an automorphic form (as in Corollary 5.2). It roughly says that it lies on the 
same component as some ry,| Te (identifying F; ~ F)—in turn more restrictive 


than saying the reduction p is automorphic. In some situations one can check this 
hypothesis, employing the available automorphy lifting theorems. See Corollary 5.4 
on “potentially diagonalizable” p in [13], and their Corollary 5.5—which we quote: 


Corollary 6.2 (—). Suppose p > 2. Let p : 'y — GL,(E) be a generic potentially 
semistable representation, of regular weight. Further, assume that either 


(1) n = 2, and p is potentially Barsotti-Tate, or 

(2) F/Q, is unramified, p is crystalline, p # n, and the Hodge-Tate weights of p 
are in the “extended Fontaine-Laffaille range” (meaning the difference of any 
two weights in HT, (p) is at most p — 1). 


Then BS(p) admits a nonzero unitary admissible Banach completion. 


As hinted at already, they actually construct a candidate IT() for p-adic local 
Langlands and verify (under the given hypotheses) that BS(p) <> IT(p). Moreover, 
they express optimism in [13], Remark 5.8, that BS() coincides with the space of 
locally algebraic vectors IT(p)“%—evidence that IT(p) is “purely local” although 
it is defined via choices of auxiliary global data. Convincing evidence for Shimura 
curves is provided by Emerton et al. [22]. 

The definition of [7(e) in [13] is based on their “hypothetical formulation” of 
p-adic local Langlands in Sect. 6.1: Given a representation p : ry — GL,(kg) with 
Endp,(0) = ke (for simplicity) there should be a finitely generated R5[[GL,(@)]]- 
module Loy, which is @z-torsion free, and such that the GL,(@)-action can be 
promoted to an R;-linear action of the full GL, (F). Then p-adic local Langlands 
p »> B(p) should arise from specializing Loo as follows. Say p : Ir > GLn(@z) is 
a lift of 6, corresponding to a point x € (SpecR;)(@z). Then, 
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B(p) = (Loo @rzx Gx)"[1/p], 


where d denotes taking the (Schikhof) dual® Hom, (—, Gz). When Endr,(p) 4 kr 
one would replace R; by the universal framed deformation ring R=’, and Log should 
be endowed with additional structure (see the footnote on p. 45 of [13]). 

The construction in [13] starts off from a p : Ip — GL,(kz), which admits 
a “potentially diagonalizable” lift p,4, to Gg. This assumptions allows them to 
pass to a global setup (cf. Sect. 5 above), and suitably extend p to a representation 
I, > GLy(ke). Carrying out a delicate variant of the Taylor-Wiles-Kisin patching 
method for modular forms on G, but letting the weight and the p-part of the level 
vary, they construct an Roo[[GL,(@)]|-module Moo (defined on p. 15 in loc. cit.), 
where Roo is a multivariate power series ring over some local deformation ring R!° 
(rather, a tensor product of such). Proposition 2.8 in loc. cit. establishes the key fact 
that the GL,,(@)-action on Mg extends naturally to a GL, (F)-action. 

Now, if p : Ty — GL,(@z) is a lift of p, the definition of [7(p) proceeds in 
the following steps: Under the identification F ~ F; our lift p corresponds to a 
point x : Ro > OG, which we extend to a point x’ : R!’° > Gz, using the given 
lift Ppa, at the places in S, away from p. We extend x’ further, and arbitrarily, to a 
homomorphism y : Roo > Ge. Recall that Roo X R!°[[x1,.... xy]. Then, 


TT(p) := (Moo @kooy Gr)"[1/p].- 


(See Sect. 2.10, p. 19 in [13].) It should be stressed that I7(p) is not known to be 
independent of the “globalization”, nor the choice of extension y. Furthermore, in 
general it is not known whether J7(p) £ 0. 


In Sect. 6.2 of [13] they speculate that Moo 7 Roo @r; Leo: 
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